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Few  molecular  events  important  in  platelet  biogenesis  have  been 
identified.  Mice  homozygous  for  the  recessive  mutation  gunmetal  (grm)  have 
platelet  and  megakaryocyte  defects  that  cause  prolonged  bleeding  and  high 
mortality;  gm  platelets  are  increased  in  size,  decreased  in  abundance,  lack  a- 
and  5-granule  proteins,  and  contain  abnormal  small  GTPases;  gm 
megakaryocytes  are  increased  in  abundance  and  have  abnormal  intracellular 
membrane  accumulations.  The  main  goal  of  this  dissertation  work  was  to 
identify  the  gene  responsible  for  the  gm  mouse  mutant  using  a positional  cloning 
strategy.  Positional  cloning  is  an  approach  to  disease  gene  identification  based 
primarily  on  genetic  location.  The  steps  involved  include:  (1)  genetic  localization 
(2)  physical  contig  assembly;  (3)  candidate  gene  identification;  (4)  mutation 
analysis;  and  (5)  characterization  of  gene  mutation. 
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Using  2462  gm  backcross  mice,  we  localized  gm  to  a 0.64  centiMorgan 
(cM)  region  of  mouse  chromosome  (Chr)  14.  Using  direct  cDNA  selection,  the  a- 
subunit  of  Rab  geranylgeranyl  transferase  (Rabggta)  was  isolated  from 
contiguous  yeast  artificial  chromosome  (YAC)  and  bacterial  artificial 
chromosome  (BAG)  clones  that  contained  this  region.  In  gm  mice,  DNA 
sequence  analysis  of  Rabggta  identified  a G ->  A substitution  at  residue  -1  of  the 
intronic  splice  acceptor  site  preceding  exon  1 (which  contains  the  start  codon). 
Most  gm  Rabggta  mRNAs  lacked  the  start  codon  due  to  exon  1 skipping,  while  a 
minute  amount  retained  the  start  codon  due  to  the  utilization  of  cryptic  splice 
sites.  Rab  geranylgeranyl  transferase  (Rab  GGTase,  composed  of  a,  p,  and 
REP-1  components)  covalently  attaches  lipid  moieties  to  the  C-terminus  of  Rab 
GTPases,  enabling  reversible  attachment  to  membranes.  Rabs  regulate 
intracellular  vesicular  transport  by  acting  as  molecular  switches  that  oscillate 
between  membrane-attached,  GTP-bound  and  cytosolic,  GDP-bound  states. 
Levels  of  Rabggta  protein  and  Rab  GGTase  activity  were  reduced  4-fold  in  gm 
platelets.  Prenylation  and  membrane  association  of  a Rab  GGTase  substrate, 
Rab27a,  was  also  decreased  in  gm  platelets.  These  data  indicate  that 
prenylation  of  Rab  GTPases  is  important  for  platelet  synthesis  and  delivery  of 
proteins  to  platelet  granules,  and  suggest  Rab  GGTase  as  a therapeutic  target 
for  thrombocytosis  and  clotting  disorders. 
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CHAPTER  1 
INTRODUCTION 


Despite  numerous  studies,  relatively  few  molecular  mechanisms  or 
biochemical  pathways  regulating  platelet  production  by  megakaryocytes  are 
known.  This  is  a significant  deficiency,  since  inherited  and  acquired  platelet 
defects  are  not  uncommon,  and  currently  are  treated  with  repeated  platelet 
transfusions,  which  are  very  costly,  may  transmit  infectious  diseases,  and  may 
result  in  alloimmunization.^'^  Identification  of  the  molecular  basis  of  Mendelian 
mouse  models  of  platelet  defects  is  an  effective  way  to  identify  new  and 
important  pathways  or  factors  involved  in  platelet  production  from 
megakaryocytes. 

Megakaryocytes  and  Platelets 

Hematopoiesis  involves  the  differentiation  of  a bone  marrow  pluripotent 
stem  cell  into  various  cellular  components  of  the  blood.  The  megakaryoblast  is 
one  of  the  cell  types  that  differentiates  from  the  stem  cell.  With  the  aid  of  certain 
signals,  this  cell  gives  rise  to  the  megakaryocyte  and  ultimately  to  2000-8000 
platelets  (thrombocytes).^'^ 
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Megakaryocyte  maturation  is  marked  by  an  increase  in  size,  development 

of  organelles,  and  acquisition  of  platelet  proteins  such  as  glycoprotein  llb/llla, 

thrombospondin  (TSP),  von  Willebrand  factor  (vWF),  and  platelet  factor  4 (PF4). 

The  following  outlines  the  stages  and  factors  involved  in  megakaryocytopoiesis: 

Stages  of  Meoakarvocvtopoiesis^ 

• BFU-MK:  The  burst-forming  unit-megakaryocyte  is  the  most  undifferentiated 
committed  megakaryocyte  progenitor  cell.  It  has  great  proliferative  potential 
and  can  divide  to  produce  many  CFU-MKs. 

• CFU-MK:  The  colony  forming  unit-megakaryocyte  is  a more  differentiated 
progenitor  cell  than  BFU-MK,  but  contains  low  proliferative  potential. 

• Megakaryoblast:  The  maturational  events  of  megakaryocytopoiesis  are 
thought  to  start  with  this  mononuclear  cell  type.  These  cells,  representing 
~5%  of  all  megakaryocytic  elements  in  the  marrow,  are  characterized  by  their 
small  size  and  round  morphology.  They  also  express  phenotypic  markers 
belonging  to  the  megakaryocyte,  such  as  proteins  contained  in  the  a-granule 
and  membrane  glycoproteins.^ 

• Megakaryocyte:  This  is  the  final  developmental  stage.  As  megakaryocytes 
proliferate,  they  enlarge  and  undergo  extensive  DNA  replication  without 
mitosis.  The  polyploid  cells  subsequently  undergo  endomitosis  to  form 
multilobed  nuclei  with  four  to  64  times  the  haploid  amount  of  DNA,  most 
commonly  16  or  32N.''  The  mature  megakaryocyte  is  located  directly 
adjacent  to  bone  marrow  sinusoidal  endothelial  cells.  As  extensive 


3 


demarcation  membranes  form  by  invaginations  of  the  plasma  membrane, 
megakaryocytes  develop  long  filopodia  that  directly  penetrate  the  endothelial 
cytoplasm  and  extend  into  the  marrow  capillaries.®  These  projections  then 
fragment  to  produce  mature  thrombocytes  (platelets).  More  than  2000 
platelets  are  ultimately  produced  from  the  cytoplasm  of  a single  mature 
megakaryocyte.'* 

Factors  Involved  in  Meaakarvocvtopoiesis* 

• Proliferative/Growth  factors  = thrombopoietin  (TPO),  IL-3,  & GM-CSF 

• Maturative/Differentiation  factors  (cell  compartment  factors)  = TPO,  IL-6,  & 11 

• Inhibitory  factors  = platelet  factor  4 (PF4) 

Platelets  are  very  small,  anucleate  cells  that  contain  localized 
concentrations  of  molecules  required  for  hemostasis,  the  stopping  of  a flow  or 
circulation  of  blood.  Platelets  are  heterogeneous  in  size  and  reactivity,  and  large 
platelets  are  more  hemostatically  active  than  small  platelets.®  Platelets  must 
undergo  three  processes  for  hemostasis  to  occur:  adhesion  to  exposed  collagen 
in  blood  vessels;  release  of  the  contents  of  their  granules;  and  aggregation,  as 
described  below.^ 

1 . Platelet  adhesion  to  collagen  on  the  damaged  vessel  wall  is  the  first  step 
in  hemostasis  and  thrombosis,  the  development  of  a clot.  This  activation 
process  is  mediated  by  von  Willebrand’s  factor  (vWF),  a glycoprotein 
secreted  by  endothelial  cells  and  platelets  into  plasma.  The  vWF  protein 
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acts  as  a secure  bridge  between  a glycoprotein  complex  on  platelet 
surfaces  and  collagen  fibrils  in  the  walls  of  blood  vessels.  It  thus  prevents 
platelets  from  being  detached  from  vessel  walls  by  high  shearing  forces 
developed  in  the  blood  vessels.® 

2.  Once  the  platelets  are  activated  they  change  shape  and  release  the 
contents  of  their  alpha  and  dense  granules  to  increase  activation  and 
promote  aggregation.  The  alpha  (a)  granules  are  distinct  storage 
organelles  containing  a plethora  of  proteins,  such  as  vWF,  fibronectin,  and 
thrombospondin.  When  released,  these  proteins  take  part  in  platelet 
reactions  during  adhesion,  aggregation,  and  clot  formation.®  Dense  (5) 
granules  differ  from  a-granules  in  that  their  electron-dense  contents  may 
fill  the  whole  or  the  central  matrix  of  the  organelle.  The  components  of  5- 
granules  (ADP,  Ca^"^,  and  serotonin)  aid  in  platelet  aggregation  and 
transmembrane  signaling.  Granule  release  reaction  disorders  are  known 
as  storage  pool  diseases  (SPDs).  SPD  of  the  a-granules  include  gray 
platelet  syndrome  while  SPD  of  the  8-granules  involve  Hermansky-Pudlak 
syndrome  and  Chediak-Higashi  syndrome.^®  SPD  of  both  a-  and  5- 
granules  together  (alpha/delta  SPD)  has  also  been  reported.” 

3.  Increased  platelet  activation  with  thrombin  and  ADP  allows  interplatelet 
contact  and  the  formation  of  platelet  aggregates.  ADP  modifies  the 
platelet  surface  so  that  fibrinogen  will  attach  to  a complex  of  two 
glycoproteins  on  the  platelet  surface.  Molecules  of  fibrinogen  then  link 
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adjacent  platelets  to  each  other,  forming  a platelet  aggregate.^^  The 
aggregate  structure  results  in  an  impermeable  multicellular  thrombocyte 
thrombus  (clot)  that  eventually  releases  tissue  plasminogen  activator 
(TPA)  to  prevent  complete  blockage  of  the  vessel.® 

The  above  processes,  megakaryocytopoiesis  and  thrombopoiesis,  are 
thought  to  be  under  the  control  of  a feedback  mechanism  involving  platelet 
contents  as  the  source  of  inhibitory  factors  for  megakaryocyte  progenitor  cell 
growth. As  platelet  numbers  increase,  inhibition  of  megakaryocyte  progenitor 
cell  growth  increases,  resulting  in  fewer  megakaryocytes  and  thus  a decrease  in 
platelets.  A low  platelet  count  would  reduce  megakaryocyte  inhibitory  factors, 
and  thus  megakaryocyte  numbers  would  increase,  resulting  in  more  platelets. 

Gunmetal 

A single-gene  mutation  in  the  mouse  that  affects  platelet  production  is  the 
pigment  mutation  gunmetal  {gm).  It  is  an  autosomal  recessive  mutation  that 
arose  spontaneously  in  the  C57BL/6J  strain  in  1962.^'*  It  was  initially  localized  to 
central  mouse  Chromosome  14.’®  Gunmetal  homozygotes  {gm/gm)  display  a 
syndrome  of  partial  albinism  and  platelet  defects  that  is  associated  with 
prolonged  bleeding,  high  mortality,  and  poor  fecundity. 
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Characteristics  of  am/am  Megakaryocytes  and  Platelets 

Gunmetal  homozygotes  suffer  from  macrothrombocytopenia,  a decrease 
in  number  and  increase  in  size  of  blood  platelets  (Figure  1.1).  gm/gm  mice  have 
only  40%  of  the  normal  platelet  count,  whereas  gm/+  mice  have  a normal  platelet 
count.  This  deficiency  is  due  to  a 70%  reduction  in  platelet  synthesis  in  gm 
homozygotes  compared  to  normal  mice.^®  gm/gm  mice  also  exhibit  abnormal 
heterogeneity  in  platelet  size.  Many  of  the  platelets  observed  in  gunmetal  mice 
are  significantly  larger  than  in  normal  mice. 


Figure  1.1:  Transmission  electron  micrographs  of  gm/+  and  gm/gm  platelets 
(adapted  from  Novak  et  a!.,  1995).  (A)  The  gm/+  mouse  platelets  are  normal  in 
size,  shape,  and  appearance.  OM  x 16,000.  (B)  The  gm/gm  mouse  platelets 
show  greater  size  heterogeneity,  and  some  contain  membrane  complexes 
(arrow).  OMx  16,000.  DB,  dense  body;  aG,  alpha  granule. 
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Gunmetal  homozygotes  display  several  dense  granule  abnormalities,  thus 
laying  the  foundation  for  a mild  6-SPD.  The  gmigm  serotonin  component  of 
dense  granules  is  only  50%  of  normal.^®  Additionally,  the  number  of  dense 
granules  per  gmigm  platelet  is  also  50%  of  normal.^®  Because  of  the  larger 
volume  of  gm/gm  platelets,  the  above  are  minimal  estimates. 

Gunmetal  homozygotes  exhibit  numerous  alpha-granule  abnormalities, 
the  basis  of  a severe  a-SPD.  Three  a-granule  components  (fibrinogen,  vWF, 
and  PF4)  in  platelets  from  gm/gm  mice  show  significant  quantitative  reductions 
on  Western  blot  when  compared  with  normal  mouse  platelets.^®  These  a- 
granule  abnormalities  suggest  that  gm  may  be  an  animal  model  for  human  gray 
platelet  syndrome  (see  below). 

Gunmetal  homozygotes  demonstrate  an  abnormal  megakaryocyte  count. 
A four-fold  increase  in  megakaryocyte  count  was  observed  in  gm/gm  mice.^^ 

This  could  be  the  result  of  a feedback  mechanism  responding  to  the  decreased 
platelet  production. 

Gunmetal  homozygotes  show  signs  of  abnormal  emperipolesis,  a process 
in  which  other  cells  traffic  through  the  cytoplasm  of  the  megakaryocyte  (Figure 
1.2).  This  event  is  greatly  increased  in  gunmetal  homozygotes  compared  to 
normal  mice.^^  It  has  been  suggested  that  chemotactic  factors  released  from 
leaky  mutant  megakaryocyte  a-granules  recruit  other  cells  within  the  mutant 
megakaryocyte  cytoplasm.  This  is  likely  because  a-granule  components  are 
reduced  in  platelets  but  are  increased  in  plasma  oi  gm/gm  mice.^^ 
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Figure  1.2:  Transmission  electron  micrograph  of  a gml+  and  gm/gm  mouse 
megakaryocyte  (adapted  from  Novak  et  al.,  1995).  In  contrast  with  the  (A)  gm/+ 
normal  megakaryocyte,  the  (B)  gm/gm  mutant  megakaryocyte  contains  zones  of 
membrane  complexes  (DMS,  arrows)  and  an  emperipolesed  leukocyte  (asterisk). 
The  inset  (B,  top  left)  shows  another  cell  from  the  same  specimen  that  contains 
17  emperipolesed  cells.  OM  x 7,000;  inset  OM  x 1,900;  N,  nucleus. 
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Gunmetal  homozygotes  display  abnormal  expression  of  two  platelet  low- 
molecular-weight  GTP-binding  proteins  (Figure  1.3).  Normal  mouse  platelets 
express  27.5,  26,  & 23-Kd  GTP-binding  proteins,  gm/gm  mouse  platelets 
express  these  three  GTP-binding  proteins  as  well  as  two  novel  (28.5  & 25-Kd) 
GTP-binding  proteins.^®  The  expression  of  these  novel  GTP-binding  proteins  is 
platelet  specific  and  genetically  recessive  in  nature.^®  GTP-binding  proteins  have 
been  implicated  in  platelet  formation  due  to  their  regulatory  role  in  membrane 
budding  and  fusion  reactions.®'^®'^® 


Figure  1.3:  Abnormal  expression  of  LMW  GTP-binding  proteins  in  platelets  of 
gunmetal  mice  (adapted  from  Swank  et  al.,  1993).  Platelet  extracts  from  gm/gm, 
gm/+,  and  +/+  mice,  equivalent  in  protein,  were  electrophoresed  on  SDS  gels 
and  Western-blotted  to  nitrocellulose  membranes  that  were  incubated  with 
pP]GTP  followed  by  autoradiography.  The  arrows  indicate  the  28.5  and  25.0- 
Kd  GTP-binding  proteins  specific  to  gm/gm  platelets. 
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Conclusions  about  the  gm  phenotype  are  as  follows:  (1)  The 
macrothrombocytopenia  suggests  a low  rate  of  platelet  synthesis;  (2)  The 
reduction  of  several  a-granule  components  in  platelets  suggests  a general  effect 
on  organelle  biogenesis  rather  than  on  specific  a-granule  components;  (3)  The 
high  megakaryocyte  count  suggests  a feedback  mechanism  is  trying  to 
counteract  the  low  rate  of  platelet  synthesis;  (4)  The  deficiency  of  several  a- 
granule  proteins  in  the  platelet  a-granule  and  their  appearance  in  plasma 
suggests  an  abnormality  in  targeting  or  retention  of  a-granule  proteins  rather 
than  a defect  in  synthesis;  (5)  An  abnormality  in  targeting  could  also  account  for 
the  abnormal  emperipolesis  in  gm  megakaryocytes;  (6)  The  novel  platelet  GTP- 
binding  proteins  may  account  for  the  platelet  production  deficiency  based  on  the 
platelet  activating  capabilities  of  some  GTP-binding  proteins;  (7)  Additional 
evidence  that  shows  reciprocal  bone  marrow  transplantation  between  gm/+  and 
gm/gm  either  corrected  or  caused  abnormal  platelet  size  and  bleeding  times/^ 
suggests  a defect  in  a marrow  progenitor  cell  is  responsible  for  the  hematological 
defects  associated  with  gunmetal  mice. 

Collectively,  the  characteristics  of  gm  megakaryocytes  and  platelets 
suggest  that  the  gunmetal  gene  acts  intrinsically  in  megakaryocytes  and  that  an 
abnormality  in  this  gene  causes  significant  qualitative  and  quantitative  effects  on 


platelet  production. 
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Gray  Platelet  Syndrome 

The  most  typical  a-SPD  in  human  is  gray  platelet  syndrome  (GPS).^°  Mild 
mucocutaneous  bleeding  tendency,  moderate  thrombocytopenia,  large  platelets, 
and  incomplete  aggregation  responses  to  collagen,  thrombin,  or  Ca^* 
characterize  this  autosomal  recessive  human  disorder.^®  Ultrastructural  and 
cytochemical  studies  have  shown  that  the  platelets  contain  a normal  number  of 
dense  granules  but  specifically  lack  a-granules.  The  platelet  a-granule  content 
(PF4,  fibrinogen,  TSP,  fibronectin)  is  markedly  decreased.^®  GPS 
megakaryocytes  are  also  deficient  in  a-granules,  however,  plasma  levels  of  a- 
granule  proteins  are  increased.  This  suggests  that  GPS  megakaryocytes  can 
normally  synthesize  a-granule  proteins  but  can  not  package  them  into  precursors 
of  the  a-granule.^® 

Among  inherited  human  bleeding  diseases,  gunmetal  is  most  similar  to 
GPS.  Both  have  deficiencies  of  platelet  a-granule  proteins,  reduced  platelet 
numbers,  and  enlarged  platelet  volumes.^®  The  appearance  of  platelet  a-granule 
proteins  in  plasma  is  likewise  common  to  both.  GPS  patients  usually  have  a 
more  marked  deficiency  of  a-granule  components  and  do  not  exhibit  the  5-SPD 
seen  in  gm  mice.  However,  the  apparent  5-SPD  difference  between  GPS  and 
gm  may  be  artifactual  because  5-SPD  in  gunmetal  is  mild  (only  a twofold 
decrease  in  dense  granule  content)  and  because  a similar  twofold  variation  in 
platelet  dense  granule  content  is  found  among  human  controls.^® 
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The  Aim 

Therefore,  the  main  objective  of  this  dissertation  work  was  to  positionally 
clone  and  characterize  the  gene  responsible  for  the  mouse  mutation  gunmetal, 
in  order  to  gain  insights  into  the  biological  processes  involving 
megakaryocytopoiesis  and  thrombopoiesis.  After  the  construction  of  two  gm 
genetic  backcrosses,  the  following  specific  aims  for  the  present  dissertation  were 
determined  (see  flow  diagram,  Figure  1.4): 

Specific  Aim  1:  Create  a Precise  Genetic  Map  of  Mouse  Chr  14  in  Vicinity  of  am 

DNA  from  three  large  gm  interspecific  backcrosses  will  be  PCR-amplified 
with  polymorphic  microsatellite  markers  (SSLP)  to  localize  gm  within  an  ~0.1cM 
interval  on  central  mouse  Chr  14. 

Specific  Aim  2:  Generate  a Physical  Map  Spanning  the  am  Critical  Region 

A yeast  artificial  chromosome  (YAC)  contig,  with  several-fold  redundancy, 
of  the  gm  nonrecombinant  interval  will  be  constructed  by  screening  YAC  libraries 
using  genetic  markers  tightly  linked  to  gm.  New  Sequence-Tagged  Site  (STS) 
will  be  derived  if  necessary  and  used  with  the  initial  genetic  markers  to  STS 
content  map  YAC  clones. 

Specific  Aim  3:  Identify  am  Candidate  Genes  Based  on  Location.  Function,  and 
Expression 

cDNA  selection  will  be  used  to  isolate  genes  from  YACs  spanning  the  gm 
critical  region.  These  genes  will  be  examined  for  candidacy  for  gm  based  on 
genetic  location,  known  function,  and  tissue  distribution  of  expression. 
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Specific  Aim  4:  Screen  am  Candidate  cDNAs  for  Mutations 

Genes  considered  favorable  candidates  for  gm  will  be  screened  for 
mutations  using  RT-PCR,  Northern  blots,  Southern  blots,  and  sequence  analysis 
from  gm  and  normal  mice. 

Specific  Aim  5:  Characterize  the  am  Gene 

Intense  characterization  of  the  gm  gene  requires  full-length  cDNA.  gm 
cDNA  clones  will  be  isolated  from  bone  marrow  cDNA  libraries  until  a full-length 
clone  is  assembled  and  sequenced.  The  full-length  mouse  clone  will  be  used  to 
isolate  and  map  the  homologous  human  gene.  Molecular  characterization  of  the 
gm  gene  will  aid  in  the  understanding  of  inherited  human  platelet  disorders  such 
as  gray  platelet  syndrome. 


Figure  1.4:  Flow  diagram  that  summarizes  the  strategy. 


14 


The  Significance 

Platelet  organelles  and  enzyme  systems  are  created  in  the 
megakaryocyte,  and  changes  in  platelet  volume  and  reactivity  are  correlated  with 
and  preceded  by  changes  in  the  megakaryocyte.^  Therefore,  perturbations  of 
megakaryocyte  physiology  may  have  important  pathological  implications  for  the 
development  of  atherosclerosis  and  thrombosis,  which  remain  a major  cause  of 
morbidity  and  mortality  in  the  Western  world.  Identification  and  analysis  of  the 
gunmetal  gene  is  a unique  opportunity  to  learn  about  new  pathways  and  proteins 
involved  in  these  processes.  This  project  uses  the  power  of  mouse  genetics  and 
the  fortuitous  gm  mutation  to  increase  our  understanding  of  megakaryocyte 
physiology.  A better  understanding  of  the  biogenesis  of  platelet  alpha  and  dense 
granules  will  contribute  to  treatments  and  cures  for  some  of  these  devastating 
blood  disorders  by  identifying  affected  biochemical  pathways,  thereby  providing 
new  and  improved  therapeutic  targets. 
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Initial  Studies  Toward  First  Specific  Aim 

At  the  outset  of  our  study,  a precise  genetic  map  in  the  vicinity  of  gm  had 
been  generated  with  the  aid  of  our  collaborators  (Dr.  Richard  Swank,  Roswell 
Park  Cancer  Inst.,  Buffalo,  NY).^°  Two  large  interspecific  backcrosses  were 
produced  in  an  attempt  to  maximize  the  chance  of  finding  polymorphism  in  the 
gm  region.  The  progeny  were  typed  with  many  polymorphic  microsatellites  and 
genes  in  the  vicinity  of  gm  (Figure  1 .5).  gm  was  localized  to  central  mouse  Chr 
14  by  minimizing  single  crossover  events  and  eliminating  double  crossover 
events.^’  The  results  are  described  below. 
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Figure  1.5:  Preliminary  gm  genetic  mouse  maps. 
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Cross  #1  Results:  r(C57BL/6J-am  x PWKIF.  X C57BL/6J-am1  = -750  progeny^" 

• gm  is  localized  to  a 1 .5cM  interval. 

• gm  is  flanked  proximally  by  Myhca  and  distally  by  D14Mit122. 

• No  markers  cosegregate  with  gm. 

• This  cross  appears  to  have  undergone  a distance  expansion  (recombination 
amplification)  and  polymorphism  decrease.  The  few  markers  that  are 
polymorphic  in  this  cross  have  genetically  larger  distances  between  them, 
compared  to  cross  #2  and  other  published  mapping  results.  Thus,  this  region 
may  be  a hot  spot  for  recombination  in  the  PWK  mouse  strain. 

Cross  #2  Results:  f(C57BL/6J-om  x SPRETIF.,  X C57BL/6J-q/77l  = ~300  progeny 

• gm  is  localized  to  a 0.9cM  interval. 

• gm  is  flanked  proximally  by  D14MH63  and  distally  by  D14Mit5  & 65. 

• D14Mit83,  84,  122,  215,  & 261  cosegregate  with  gm  in  300  meioses. 

• This  cross  appears  to  have  undergone  a distance  compression 
(recombination  suppression)  and  polymorphism  increase.  Many  more 
markers  are  polymorphic  in  this  cross  than  in  cross  #1 , however,  most  are  not 
separable  by  genetic  mapping.  This  region  may  be  a recombination  cold  spot 


in  the  SPRET  mouse  strain. 
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The  two  gm  crosses  mentioned  above  have  failed  to  determine  an 
accurate  genetic  location  of  gm.  The  lack  of  polymorphic  markers  in  the  PWK 
cross  has  prohibited  the  mapping  of  many  markers,  although  the  ones  that  have 
been  polymorphic  were  informative  in  localizing  gm  either  proximal  or  distal  to 
the  marker.  The  SPRET  cross  has  provided  many  polymorphic  markers. 
However,  the  lack  of  recombination  between  B6  & SPRET  in  the  gm  region  is 
such  that  the  markers  all  segregate  together,  and  thus  gm  can  not  be  localized 
relative  to  many  of  them.  With  this  starting  point,  further  genetic  mapping  and 
positional  cloning  techniques  were  undertaken  to  clone  the  gunmetal  gene,  as 
described  in  the  following  chapters. 


CHAPTER  2 

IDENTIFICATION  OF  NOVEL  SIMPLE  SEQUENCE  LENGTH 
POLYMORPHISMS  (SSLP)  IN  MOUSE  BY  INTERSPERSED  REPETITIVE 

ELEMENT  (IRE)-PCR 


Introduction 

To  address  the  need  for  additional  markers  in  the  gm  region,  the  method 
described  in  this  chapter  was  developed  and  employed.^^  Positional  cloning  is 
an  important  method  of  disease  gene  identification  in  human  and  in  model 
organisms  such  as  mouse.  This  approach  is  based  on  gene  identification  from 
contiguous  genomic  clones  spanning  a genetically  defined  interval  that  contains 
a locus  of  interest.^^  A prerequisite  of  positional  cloning  is  the  creation  of  dense 
genetic  and  physical  maps  for  the  chromosomal  region  of  interest.  One  of  the 
most  informative  and  widely-used  type  of  genetic  markers  is  the  simple 
sequence  length  polymorphism  (SSLP),  also  known  as  polymorphic 
microsatellite  markers.^'*  Such  markers  are  made  up  of  serial  di-,  tri-,  tetra-  or 
penta-nucleotide  repeats,  where  the  number  of  repeat  units  differs  among 
individuals/strains  within  a species.  Microsatellite  markers  are  easily  amplified 
by  PCR  and  exhibit  strain-specific  polymorphisms  upon  observation  on  agarose 
or  polyacrylamide  gels.  However,  for  most  regions  of  the  mouse  genome, 
current  SSLP  density  is  insufficient  for  positional  cloning  approaches. 
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Conventional  approaches  to  the  identification  of  novel  SSLPs  from  a genomic 
region,  such  as  subcloning  genomic  regions  and  hybridization  with  repeat 
probes,  are  cumbersome. 

Interspersed  repetitive  element  (IRE)-PCR  is  a very  useful  method  for 
identification  of  novel  human  or  mouse  sequence  tagged  sites  (STSs)  from 
contigs  of  genomic  clones  or  somatic  cell  hybrids.^®'^®  This  technique  achieves 
selective  amplification  of  mammalian  DNA  from  an  admixture  with  yeast  or 
prokaryotic  DNA,  without  prior  knowledge  of  the  mammalian  DNA  sequence,  by 
PCR  using  outwardly  oriented  primers  that  are  specific  for  mammalian  high  copy 
number  repetitive  elements.  For  example,  IRE-PCR  of  mouse  genomic  DNA 
may  be  accomplished  using  primers  designed  from  the  5’  and  3’  ends  of  the  B1 
repetitive  element  DNA  sequence,  which  belongs  to  the  short  interspersed 
element  (SINE)  family  (Figure  2.1).  The  130-150  bp  B1  repetitive  element,  which 
is  believed  to  be  homologous  to  the  human  ‘Alu’  repeat,  is  present  at  a copy 
number  of  130,000-180,000  per  haploid  mouse  genome,^®  and  frequently  is 

located  at  intron-exon  junctions  within  genes. 

^ 5’  B1  Reverse 

CCGGGCATG  GTGGTGCATG  CCTTTAATCC  CAGCACTCGG 
GAGGCAGAGG  CAGGCGGATT  TCTGAGTTCG  AGGCCAGCCT 
GGTCTTCAGA  GTGAGTTCCA  GGACACCAGG  GCTACAGAGA 
AACCCTGTCT 

Figure  2.1:  Consensus  sequence  of  the  B1  repetitive  element.^®  Primers  used 
for  the  IRE-PCR  are  identified  above  the  sequence  with  arrows.^^ 


20 


Materials  and  Methods 

As  described  below,  B1-B1  IRE-PCR  (Figure  2.2)  was  exploited  to  identify 
novel  polymorphic  SSLPs  from  genomic  clones  containing  mouse  DNA. 

Genomic  DNA  from  4 different  mouse  chromosome  14  YAC  clones  (63B10, 
148G4,  183B10,  and  200A8)  was  isolated  as  part  of  a positional  cloning  effort 
(see  Chapter  4)  and  used  as  the  template  for  three  separate  IRE-PCR  reactions. 


Mouse  YAC  clone 
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PCR  with  B1  primers 
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Library  ofinter-Bl  products 


Clone  en  masse  & sequence, 
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PCR  amplicon  = novel  STS 


:STS: 


Use  to  assess  YAC  contiguity  & isolate  additional  YACs 


Figure  2.2:  Flow  diagram  of  the  IRE-PCR  method.^^ 
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The  DNA  was  amplified  with  a 5’  B1  reverse  primer  alone,  a 3’  B1  forward 
primer  alone,  and  a combination  of  5’  B1  reverse  and  3’  B1  forward  primers 
together  (Figure  2A)}^  The  PCR  master  mixes  contained  (for  one  50  ^l 
reaction):  50  ng  of  yeast  genomic  DNA  (containing  a YAC),  1 jxM  of  each  primer, 
200  pM  of  each  dNTP,  1x  PCR  buffer,  and  2.5  units  of  Taq  DNA  polymerase 
(Boehringer  Mannheim,  Indianapolis,  IN).  The  PCR  amplifications  were 
performed  in  a PTC-100  Programmable  Thermal  Controller  (MJ  Research, 
Watertown,  MA).  The  conditions  involved  an  initial  2 min  denaturation  at  95°C; 
then  33  cycles  of  95°C  for  20  sec,  55°C  for  20  sec,  and  72°C  for  90  sec;  and 
finally  a 10  min  polishing  extension  at  72°C.  Upon  amplification,  an  aliquot  of 
each  reaction  was  electrophoresed  on  a 2%  agarose  gel  to  ensure  amplicon 
(PCR  product)  complexity  of  at  least  10  readily  identified  bands.  Using  these 
conditions,  amplicons  varied  in  size  between  200  bp  and  2500  bp.  Following 
purification  of  the  remaining  45  pi  of  PCR  reaction  (Wizard  kit,  Promega, 
Madison,  Wl),  the  three  IRE-PCR  products  from  each  YAC  were  admixed  and  6 
pi  of  each  pool  was  cloned  en  masse  (TA  cloning  kit,  Invitrogen,  San  Diego,  CA). 
Twenty-two  recombinant  colonies  were  picked,  grown  in  liquid  culture,  plasmid 
DNA  was  isolated  (Wizard  Plus  Miniprep,  Promega),  and  inserts  were  end- 
sequenced  with  Ml 3 vector  primers  using  an  ABI  PRISM  310  Genetic  Analyzer 
and  appropriate  cycle  sequencing  reagents  (Perkin-Elmer,  Foster  City,  CA). 
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Results  and  Discussion 

End  sequence  was  obtained  from  19  clones,  15  of  which  contained  simple 
sequence  repeats  (at  least  two  clones  containing  simple  sequence  repeats  were 
identified  in  each  of  the  four  chromosome  14  YACs).  Specific  primers  were 
designed  from  unique  sequences  flanking  the  repeat  units  in  six  clones  to  permit 
specific  amplification  of  these  loci.  All  gave  a single  amplicon  of  expected  size 
upon  PCR  using  the  YAC  clone  from  which  they  had  been  derived.  Testing  DNA 
from  several  mouse  strains  revealed  that  all  six  STSs  were  polymorphic  (Table 
2.1).^^  Genetic  mapping  in  an  intersubspecific  backcross  revealed  that  each  of 
these  polymorphisms  mapped  to  the  same  region  of  mouse  chromosome  14  as 
the  YACs  (see  Chapter  4).  Prior  to  these  experiments  the  four  chromosome  14 
YACs  were  known  to  contain  only  three  SSLPs.  The  experiments  described 
here  have  identified  six  novel  SSLPs  within  three  of  the  four  YACs.  Table  2.1 
lists  the  specific  primer  sequence  for  each  marker  along  with  the  approximate 
amplicon  size  for  each  mouse  strain  tested. 

In  order  to  determine  the  general  applicability  of  this  method  across  the 
mouse  genome,  four  YACs  from  another  mouse  autosome,  chromosome  13, 
were  used  as  template  for  B1-B1  IRE  PCR.  Of  the  20  resultant  amplicons  that 
were  cloned  and  end-sequenced,  14  contained  simple  sequence  repeats,  one  of 
which  was  tested  and  found  to  be  polymorphic  (Table  2.1).  The  latter  did  map 
back  to  the  region  of  mouse  chromosome  13  from  which  the  YAC  clones  were 
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derived  (data  not  shown).  These  experiments  showed  that  B1-B1  IRE-PCR  is  an 
efficient  method  for  identification  of  novel  mouse  SSLPs.  It  should  be  noted  that 
some  of  the  remaining  IRE-PCR  products  that  may  not  contain  simple  sequence 
length  polymorphisms  are  still  useful  as  novel  STSs. 


Table  2.1:  Novel  simple  sequence  length  polymorphisms.^^ 


Novel  SSLP 

Primer  Sequence  (S'  to  3') 

Repeat Type 

Product  Size  (bp) 

B6  CAST  pm.  DBA 

D14Sfk1 

Forward  = GAGAGAAACCCTGTCTCAAAC 
Reverse  = CCATGGTGTATGTGACTTCAG 

CAAA,  AGGG,  & AG 

239 

225* 

240* 

240* 

D14Sfk2 

Forward  = (>\GGACAGCCAGGGTTATAC 
Reverse  = CTCAAGAAGATGCCGAATAG 

CCAAA 

224 

250* 

225* 

ND 

D14Sfk3 

Forward  = GGACAGCCAGGGTTATACTG 
Reverse = ATGTTATGTCACAGAATCAGC 

CCAAA 

141 

160* 

140* 

150* 

D14Sfk4 

Forward  = TTGGCCATGCAGGCAAACAC 
Reverse  = CCTAATGTCTCCATTAACAAA 

GT&GA 

255 

255* 

270* 

ND 

D14Sfk5 

Forwarid  = GCAAGCCmAGMCAAGGC 
Reverse  = GGAACTCTCAGGAAAAGTTAC 

CA 

109 

135* 

110* 

125* 

D14Sfk6 

Fowd  = CTTGGGTAGGTCTTTCTTGC 
Reverse  = ATGTTATGTAGTAATCAGTGGC 

GAA^  GGAA.  & GGAT 

185 

185* 

190* 

ND 

D13Sfk1 

Forvvard  = CTCXBAAAAGCCAAAAGAAG 
Reverse  = GGCTTTAGATATGCAGTGAG 

GAA 

148 

170* 

160* 

140* 

The  first  column  lists  the  novel  polymorphic  markers  identified  with  the  described 
method.  The  second  column  lists  the  forward  and  reverse  primers  (5’  ->  3’)  used 
to  amplify  each  polymorphic  marker.  The  third  column  lists  the  di-,  tri-,  tetra-,  or 
penta-nucleotide  repeats  found  in  each  polymorphic  marker.  The  last  column 
lists  the  product  sizes  (in  base  pairs)  of  each  polymorphic  marker  when  using  B6 
(C57BL/6JEi),  CAST  (CAST/Ei),  PWK  (wild-derived  Mus  musculus  musculus), 
and  DBA  (DBA/2J)  mouse  strain  DNA  as  template.  B6  product  sizes  were  based 
upon  DNA  sequence  analysis.  *CAST,  PWK,  and  DBA  product  sizes  are 
estimated,  based  on  agarose  gel  electrophoresis  analysis.  ND  = not  determined. 
MGD  accession  number  J:  46919. 
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The  apparent  association  of  SSLPs  with  B1  repetitive  elements  was 
unanticipated.  However,  the  finding  that  sequences  rich  in  A^Cn  blocks  tend  to 
flank  B1  repetitive  elements^”  may  provide  an  explanation  of  the  effectiveness  of 
B1-B1  PCR  in  identifying  novel  SSLPs.  The  method  developed  here  is  a 
promising,  simple  approach  to  assist  with  the  generation  of  dense  genetic  and 
physical  maps  surrounding  mouse  loci  of  interest,  such  as  disease  loci.  The 
considerable  homology  shared  between  the  mouse  B1  repetitive  element  and 
the  human  ‘Alu’  repeat  raises  the  possibility  that  this  method  may  also  be 
applicable  to  identifying  human  SSLPs.  The  SSLPs  identified  in  the  gm  region 
were  extremely  valuable  in  narrowing  the  genetic  region,  to  allow  establishment 
of  a better  physical  map  and  subsequent  positional  cloning,  as  described  in  the 
next  few  chapters. 


CHAPTER  3 

ISOLATION  OF  MOUSE  TRANSCRIBED  SEQUENCES  FROM  YAC/BAC 
CLONES  USING  AN  INTERSPERSED  REPETITIVE  ELEMENT  (IRE)-PCR 

BASED  DIRECT  cDNA  SELECTION  APPROACH 

Introduction 

Once  the  gm  physical  contig  was  constructed  and  narrowed  using  the 
additional  markers  identified  with  the  method  described  in  chapter  2,  the  isolation 
and  identification  of  transcribed  sequences  within  the  contig,  as  part  of  positional 
cloning,  became  possible. 

One  of  the  most  rate-limiting  steps  in  positional  cloning  has  been  the 
identification  of  transcribed  sequences  (candidate  disease  genes)  within  a 
defined  genomic  region  that  contains  the  locus  of  interest.  Both  direct  cDNA 
selection^^  and  exon  trapping^^  are  well-characterized  methods  that  have  been 
used  in  numerous  positional  cloning  efforts  to  identify  candidate  disease  genes 
from  genomic  clones  that  lie  within  the  region  of  interest.  Of  these  methods, 
direct  cDNA  selection  is  more  widely  applicable  because  of  its  large  target  size, 
sensitive  nature,  tissue  specificity,  and  relative  speed.^^'^^  In  this  technique, 
biotin-labeled,  denatured  genomic  target  (YAC  etc.)  DNA  is  hybridized  in  solution 
to  tissue-specific  denatured  cDNA  to  which  PCR-amplifiable  tags  have  been 
added.  The  annealed  complexes  are  captured  and  washed,  and  bound  cDNAs 
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are  eluted  and  PCR-amplified.  Following  two  such  rounds  of  selection,  an 
~1 0,000  fold  enrichment  is  acquired.®^  Since  its  initial  description  in  1991,  this 
method  has  undergone  a number  of  modifications  and  improvements.^^'^'*  A 
significant  limitation,  however,  has  been  the  requirement  of  uncontaminated 
genomic  template  DNA  to  prevent  false  positives.  Typically,  YAC  DNA  is  purified 
by  preparative  pulse  field  gel  electrophoresis  (PFGE).  However,  inadequate 
resolution  of  YAC  and  host  yeast  chromosomes  frequently  results  in  gross 
contamination  of  direct  selection  clones  with  yeast  rRNA  clones.  Biotin  labeling 
of  the  template,  required  for  fluid-phase  selection  with  magnetic  separation,  may 
also  be  inefficient,  and  requires  radioactive  monitoring  during  nick  translation. 

To  circumvent  the  above  limitations,  we  investigated  the  usefulness  of 
interspersed  repetitive  element  (IRE)-PCR  for  the  generation  of  genomic 
template  DNA  (see  Figure  2.2).^®  As  previously  described  (Chapter  2),^^  IRE- 
PCR  is  a very  useful  method  for  selective  amplification  of  mammalian  DNA  from 
an  admixture  with  yeast  or  prokaryotic  DNA,  without  prior  knowledge  of  the 
mammalian  DNA  sequence,  by  PCR  using  outwardly  oriented  primers  specific 
for  mammalian  high  copy  number  repetitive  elements  such  as  B1  and  B2.  The 
130-150  bp  mouse  B1  repetitive  element,  which  is  believed  to  be  homologous  to 
the  human  ‘Alu’  repeat,  is  present  at  a copy  number  of  130,000-180,000  per 
haploid  mouse  genome,^®  and  frequently  is  located  at  intron-exon  junctions 
within  genes.^° 
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As  described  below,  IRE-PCR  with  biotin-labeled  repetitive  element 
primers  (B1-B1,  B2-B2,  B1-B2)  was  exploited  as  a one-step  method  for  isolating 
and  biotin  labeling  selected  mouse  genomic  template  for  direct  cDNA  selection 
(MGD  # J:51591).^®  This  novel  approach  amplifies  and  biotinylates  genomic 
DNA  flanking  B1  and  B2  repetitive  elements  from  YAC/BAC  clones.  The 
amplified  products  are  then  used  as  the  template  in  the  previously  described 
direct  cDNA  selection  technique.^'* 

Materials  and  Methods 

Genomic  DNA  from  four  different  mouse  chromosome  14  YAC  clones  was 
isolated  as  part  of  a positional  cloning  effort  (see  Chapter  4)^®  and  used  as 
template  for  ten  separate  biotinylated  IRE-PCR  reactions.  The  DNA  was 
amplified  with  all  possible  combinations  of  5’  B1  reverse  (®  CCGAGTGCTG- 
GGATTAAAGG®'),  3’  B1  forward  (®'GGACACCAGGGCTACAGAGA®  ),  5’  B2 
reverse  (®  ACTGAGCCATCTCTCCAG®  ),  and  3’  B2  forward  (®  ACATGGTGGC- 
TCACAACC®  ) repetitive  element  primers  as  previously  described  (Chapter  2),^^ 
with  the  exception  that  these  primers  were  biotinylated  at  the  5’  end  (Gibco  BRL, 
Gaithersburg,  MD).  Using  reagents  and  conditions  previously  described 
(Chapter  2),^^  biotinylated  amplicons  varied  in  size  from  200  to  2500  bp. 

Following  purification  (Qiagen,  PCR  purification  kit,  Santa  Clarita,  CA),  the 
products  from  each  YAC  were  pooled  and  ~ 100  ng  was  used  as  the  target 
genomic  template  for  direct  cDNA  selection  (Figure  3.1). 
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Figure  3.1.  Schematic  representation  of  the  IRE-PCR  modified  direct  cDNA 
selection  procedure.^’^^  Top  left  depicts  construction  of  the  linkered  cDNAs. 
Top  right  depicts  generation  of  the  target  genomic  DNA  using  IRE-PCR  with 
biotinylated  B1  and  B2  primers.  Bottom  portion  depicts  the  binding  and  elution 
of  the  selected  cDNA  fragments. 
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Random  and  oligodT  primed  mouse  bone  marrow  cDNAs  were  generated 
using  1-5  pg  of  poly  A*  RNA  (Gibco  BRL,  Superscript  choice  system).  Adhering 
to  Del  Mastro  and  Lovett’s  protocol,^''  ~3  pg  of  random  primed  and  oligodT 
primed  cDNAs  underwent  linker  ligation  (separately)  to  create  two  linkered  cDNA 
sets.  Linker  primers  were  used  to  PCR  amplify  both  sets  of  linkered  cDNAs  in 
100  pi  reactions.  The  amplified  products  were  then  purified  and  eluted  (Qiagen), 
yielding  7.5  pg  of  amplified  random  linkered  cDNA  and  7.7  pg  of  amplified 
oligodT  linkered  cDNA. 

Direct  cDNA  selection  was  then  carried  out  as  described  by  Del  Mastro 
and  Lovett.^'*  Briefly,  to  block  the  cDNA  against  yeast  DNA  and  rRNA 
contaminants,  1 pg  of  each  purified  linkered  cDNA  (2  pg  total)  was  mixed  with  1 
pg  total  yeast  DNA  and  2 pg  Cot  I (Gibco  BRL).  The  mixture  was  vacuum  dried 
to  a 10  pi  volume,  overlaid  with  oil,  denatured  for  5 min  @ 95°C,  chilled  5 min  on 
ice,  mixed  with  10  pi  2X-hybridization  solution,  and  incubated  over  night  @ 65°C. 
Approximately  100  ng  (in  15  pi)  of  the  biotinylated  target  IRE-PCR  products 
(isolated  above  from  YAC  genomic  DNA)  was  overlaid  with  oil,  denatured  for  5 
min  @ 95°C,  and  chilled  for  5 min  on  ice.  At  room  temperature,  15  pi  of  2X- 
hybridization  buffer  was  added  along  with  the  20  pi  of  blocked  cDNA.  The 
reaction  mixture  (genomic  target  + cDNA)  was  then  incubated  for  48  hrs  @ 65°C. 
Following  the  Dynabeads  protocol  (Dynal,  Oslo,  Norway),  cDNA  bound  to  the 
target  DNA  was  eluted,  neutralized,  and  purified  (Figure  3.1).  The  yield  of  eluted 
cDNA  was  4.5  pg.  Using  linker  primers,  250  ng  of  the  eluted  cDNA  was  PCR 
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amplified  in  a 100  pi  reaction,  purified,  and  tested  for  quality  (smear)  on  an 
agarose  gel.  A second  round  of  direct  selection  enrichment  was  carried  out  as 
above  using  2 pg  of  the  first  round  PCR  amplified  cDNA  products  and  additional 
biotinylated  target  IRE-PCR  DNA.  The  resulting  second  round  eluted  cDNA 
products  were  PCR  amplified  with  UDG  linker  primers  (Gibco  BRL),  purified,  and 
cloned  using  the  UDG  pAMPIO  vector  system  (Gibco  BRL).  End  sequence 
(using  T7  primer)  was  obtained  from  80  clones,  26  of  which  either  matched 
known  genes  or  were  novel  ESTs  (Table  3.1,  IRE-DS  #1).^®  All  products  (from 
genes  and  ESTs)  mapped  back  to  the  target  YACs  used  in  the  selection 
process.  Half  of  the  clones  consisted  of  rRNA. 


Table  3.1.  Isolated  direct  cDNA  selection  products  (MGD  # J:51591).®® 


SUMMARY  DATA 

IRE-DS  #1 

IRE-DS  #2 

Number  of  clones  sequenced 

80 

80 

rRNA 

40 

23 

B1  & contaminants 

14 

9 

Unique  Chr  14  Genes 

Number  of  Clones 

Cathepsin  G (Ctsg) 

5(2)* 

0 

Rab  geranylgeranyl  transferase  {Rabggta ) 

4(2)* 

15(2)* 

Ubiquitin-like  protein  (Nedd8) 

2 

21  (3)* 

Leukotrine  B4  receptor  {Ltb4r) 

0 

1 

Multispanning  membrane  protein  {Mmp) 

0 

5(2)* 

Novel  Mast  cell  protease  (Mcpt) 

1 

0 

Novel  ESTs 

14 

6 

( )*  = number  of  different  gene  regions  isolated 
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To  maximize  target  DNA  amplification  between  repetitive  elements  and  to 
reduce  rRNA  contamination,  long-range  PCR  and  additional  rRNA  blocking 
reagents  were  utilized  in  a second  direct  cDNA  selection  experiment.  Long- 
range  PCR  with  Klentaq  (Clonetech,  Advantage  cDNA  PCR  kit,  Palo  Alto,  CA) 
and  a 3 min  extension  was  carried  out  using  the  same  target  YAC  genomic  DNA, 
biotinylated  B1  and  B2  primers,  and  conditions  as  above.  Biotinylated  target 
IRE-PCR  products  ranged  in  size  from  600  bp  to  6 kb.  As  carried  out  above,  the 
linkered  cDNA  sets  were  blocked  for  contaminants,  however,  mouse  rDNA  (gift 
from  Dr.  Anand  Srivastava)  was  also  included  as  a blocking  reagent.  The 
blocked  cDNA  and  target  long-range  IRE-PCR  DNA  were  mixed,  eluted,  and 
cloned  as  above,  but  employed  a different  Dynabead  kit  (Dynal,  kilobase 
BINDER)  to  efficiently  bind  the  beads  to  the  larger  DNA  fragments.  End 
sequence  showed  that  48/80  clones  matched  known  genes  or  were  novel  ESTs 
(Table  3.1,  IRE-DS  #2).^®  All  48  clones  mapped  back  to  the  original  target  YACs. 
As  shown  in  Table  3.1 , these  minor  adjustments  to  our  first  direct  cDNA  selection 
procedure  produced  59%  fewer  contaminants  (rRNA,  B1)  and  two  additional 
genes. 

Results  and  Discussion 

Advantages  to  this  IRE-PCR  modification  of  direct  cDNA  selection 
include:  (1)  Purification  of  YAC  DNA  by  PFGE  is  not  required  since  yeast  and  £. 
coli  do  not  contain  B1  or  B2  repetitive  elements;  (2)  Use  of  5’-biotin  labeled 
B1/B2  primers  ensures  efficient  generation  of  biotinylated  target  PCR  fragments; 
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(3)  False  positive  clones  (rRNA  and  B1  contaminants)  are  reduced  in  frequency; 
and  (4)  Isolation  of  novel  STSs  flanking  B1/B2  sequences  may  be  useful  for 
saturating  a genomic  region  with  novel  genetic  markers  (we  have  recently 
reported  that  these  regions  are  enriched  for  polymorphic  simple  sequence 
repeats,  Detter  et  al.,  1998).  Since  B1  and  B2  repetitive  elements  are  abundant 
within  the  mouse  genome,  this  approach  results  in  relatively  little  loss  of  template 
complexity  and  is  widely  applicable.  Genomic  coverage  is  optimized  by  long- 
range  PCR  techniques. 

Using  the  above  IRE-PCR  based  direct  cDNA  selection  approach  (Figure 
3.1),  we  identified  five  known  genes  and  twenty-one  novel  ESTs  on  four  YAC 
clones  that  span  the  critical  region  of  our  mouse  mutation  (Table  3.1).^®  Since 
four  of  the  five  genes  identified  were  represented  among  different  direct 
selection  clones,  this  indicates  that  B1  and/or  B2  repetitive  elements  are  often 
interspersed  throughout  a gene,  further  illustrating  the  utility  of  this  method.  This 
powerful  method  is  useful  not  only  for  positional  cloning  projects,  but  also  for 
rapidly  generating  regional  transcription  unit  maps.  It  is  feasible  that  this  method 
would  also  be  applicable  to  other  mammalian  species  by  utilizing  other  high- 
copy,  species-specific  repetitive  elements.  As  seen  in  chapter  4,  we  recently 
determined  that  one  of  the  genes  isolated  (Rabggta)  was  the  causative  gene  in 
our  mouse  mutant.^® 


CHAPTER  4 

RAB  GERANYLGERANYL  TRANSFERASE  MUTATION  IN  THE  GUNMETAL 
MOUSE  CAUSES  DEFICIENT  RAB27a  PRENYLATION  AND  PLATELET 

SYNTHESIS 


Introduction 

As  described  in  the  preceding  chapters,  my  work  to  this  point  led  to 
identification  of  more  markers  in  the  gm  region,  and  isolation  of  candidate  cDNAs 
from  the  critical  region  YAC  and  BAC  clones.^^'^®  These  two  accomplishments 
were  important  steps  in  this  project  as  prerequisites  for  the  final  stages:  further 
narrowing  of  the  gm  interval,  and  selection  and  analysis  of  candidate  cDNAs,  to 
accomplish  the  positional  cloning  of  the  gm  gene. 

This  chapter  first  describes  the  refined  genetic  mapping  with  the  new 
markers,  which  narrowed  the  interval  from  1.5  cM  to  0.64  cM,  and  further 
describes  a contig  of  genomic  clones  fully  spanning  this  region.^®  The  additional 
mapping  data  were  useful  in  excluding  some  of  the  candidate  cDNAs  described 
in  chapter  3.  Among  those  cDNAs  still  within  the  gm  interval,  a portion  were 
excluded  from  gm  candidacy  based  on  tissue  expression  patterns.  The  Rab 
geranylgeranyl  transferase,  alpha  subunit  gene  (Rabggta)  was  the  top  remaining 
candidate  based  on  the  protein’s  known  function,  prenylation  of  Rab  GTPases. 
Briefly,  Rabggta  is  one  subunit  of  the  Rab  GGTase  heterodimer  which  functions 
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to  attach  20-carbon  geranylgeranyl  groups  to  small  Rab  proteins  (Rab  GTPases) 
(see  Figure  5.1,).^^’^®  This  modification  enables  the  Rab  GTPases  to  bind  to  their 
target  membranes  and  carry  out  their  prescribed  vesicular  trafficking  functions 
(including  membrane  budding  and  fusion  reactions).®®  It  made  intuitive  sense 
that  loss  or  alteration  of  the  alpha  subunit  could  lead  to  gm  features,  because  we 
would  expect  this  to  cause  a lack  of  prenylated  Rab  GTPases,  which  in  turn 
would  reduce  the  amount  of  membrane  budding  and  fusion  necessary  to 
package  a-granules,  5-granules,  and/or  platelets.  Furthermore,  the  tissue 
distribution  and  levels  of  Rabggta  expression  is  consistent  with  the  tissues  most 
affected  in  gm  mice.  Based  on  these  factors,  this  gene  was  chosen  for  further 
analysis  in  gm  mice,  and  this  chapter  describes  the  genetic  and  biochemical 
analyses  that  revealed  Rabggta  to  be  the  gm  gene.®® 
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Materials  and  Methods 

Mice  and  Genetic  Mapping 

Mice  obtained  from  the  Jackson  Laboratory  (Bar  Harbor,  ME)  were  bred  at 
Roswell  Park  Cancer  Institute  through  our  major  collaborator,  Dr.  Richard 
Swank.  Backcross  mice  were  phenotyped  by  coat  color  (as  gm/gm  [gray]  or 
gm/+  [black])  and  DMAs  were  then  shipped  to  Florida  where  I genotyped  for 
SSLP  and  RFLP  using  standard  techniques.^®  Linkage  relationships  were 
determined  by  segregation  analysis,  and  best  locus  order  was  decided  by 
minimizing  crossovers  and  eliminating  double  crossovers.^^ 

Physical  Mapping 

Mouse  YAC  and  BAC  clones  were  identified  from  libraries^®  (Research  Genetics, 
Huntsville,  AL)  by  PCR  or  hybridization  with  oligonucleotides  corresponding  to 
loci  in  the  vicinity  of  gm.  Clones  were  oriented  by  sequence  tagged  site  (STS) 
content  mapping''®  and  sequencing  clone  insert  ends.  Additional  markers 
(SSLPs  and  genes)  were  isolated  from  YAC  and  BAC  clones  by  IRE-PCR  (see 
Chapter  and  direct  cDNA  selection  (see  Chapter  3),®®  and  then  placed  on  the 
contig  based  their  location  relative  to  other  markers  (see  Figure  4.3).  BACs  were 
initially  isolated  with  the  assistance  of  Dr.  Swank’s  lab. 

IRE-Direct  cDNA  Selection 
(see  Chapter  3)®® 
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Full-length  Rabaata  cDNA  Isolation 

5’-  and  3-RACE  were  performed  with  mouse  fetal  brain  cDNA  (Clontech,  Palo 
Alto,  CA).  5’-RACE  PCR  used  a Rabggta-spec\f\c  reverse  primer  (5’-TTCACAC- 
GCAGACAGCTCTC-3’)  and  universal  forward  primer,  followed  by  a second, 
nested  PCR  reaction  with  the  Rabggta  primer  (5’-CTCTAGCC  I I I I GGCC- 
TCTG-3’)  and  second  universal  primer.  3’-RACE  was  performed  using  a 
Rabggfa-specific  forward  primer  (5’-CCAGCAGTCTGCTGCACTTC-3’)  and 
universal  reverse  primer,  and  a second,  nested  PCR  used  the  Rabggta  primer 
(5’-GGAGCGTTTGGCTGAGATGC-3’)  and  another  universal  reverse  primer. 
Products  were  subcloned  and  sequenced,  and  the  full-length  Rabggta  cDNA 
sequence  was  assembled. 

RT-PCR  Analysis 

Total  RNA  was  prepared  from  +/+,  gm/+,  and  gm/gm  tissues  (bone  marrow, 
kidney,  and  melanocyte)  with  equilibrated  phenol/guanidine  isothiocyanate 
(Gibco  BRL,  Gaithersburg,  MD).  Oligo(dT)-primed,  first-strand  cDNA  was 
synthesized  with  Superscript  II  reverse  transcriptase  (Gibco  BRL).  All  Rabggta 
exons  were  amplified  by  PCR  with  gene  specific  primers.  Aberrant  gm  splicing 
was  identified  with  the  Rabggta  primers  5’-GTGCAAGGGTCCACGGGAC-3’ 
(exon  a)  and  5’-TTCACACGCAGACAGCTCTC-3’  (exon  4). 
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Western-blot  Analysis 

Platelet  and  bone  marrow  cells  were  isolated  as  described''^  except  that  platelets 
were  washed  twice  with  0.38%  sodium  citrate  in  0.85%  NaCI  and  once  with  1% 
ammonium  oxalate  to  remove  residual  red  blood  cells.  20  pg  protein  from 
gm/gm,  gm/+,  and  C57BL/6J-+/+  tissues  in  proteinase  inhibitor  cocktail  were 
incubated  in  SDS/mercaptoethanol  and  electrophoresed  on  denaturing  10% 
polyacrylamide  gels  followed  by  Western  blotting  on  nitrocellulose  or  PVDF 
membranes.^®  For  subcellular  fractionation  studies,  fresh  platelets  were 
sonicated  and  separated  into  membrane  and  soluble  subcellular  fractions  by 
centrifugation  as  described;^®  samples  were  immediately  separated  on  12% 
polyacrylamide  gels.  Western  blots  were  probed  with  specific  antisera  against 
Rabggta  (C222)®^  or  Rab27a'‘^,  followed  by  peroxidase-labeled  goat  anti-rabbit 
IgG  secondary  antibody  (Kirkegaard  & Perry,  Gaithersburg,  MD),  and  visualized 
by  incubation  with  ECL-Plus  luminescent  reagent  (Amersham,  Piscataway,  NJ) 
and  autoradiography.'*®  Several  exposures  were  taken  and  comparisons  were 
made  in  the  linear  range.  Blots  were  washed  and  re-probed  with  antibodies  to 
mouse  actin.  Alternatively,  Western  blots  were  hybridized  with  [®®P-GTP]  and 
washed,  followed  by  autoradiography,  to  detect  GTP-binding  proteins.*® 
Quantitations  were  obtained  using  a phosphorimager  (Molecular  Dynamics, 
Sunnyvale,  CA).  Western  analysis  was  accomplished  by  Dr.  Richard  Swank’s 
laboratory  at  Roswell  Park  Cancer  Institute,  Buffalo,  NY. 
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Assay  of  Rab  GGTransferase  Activity 

Frozen  tissues  (liver,  kidney,  or  spleen)  were  thawed,  immersed  in  3 volumes  of 
buffer  (50  mM  sodium  HEPES  (pH  7.2),  5 mM  MgClj,  10  mM  NaCI,  1 mM 
dithiothreitol,  0.5  mM  phenylmethylsulfonyl  fluoride,  5 )^g/ml  aprotinin,  5 ing/ml 
pepstatin,  and  5 pg/ml  leupeptin)  and  lysed  by  polytron  homogenization. 

Lysates  were  spun  at  100,000  x g for  1 hr,  and  pellets  discarded.  Frozen  platelet 
samples  (10^-10®  platelets/pellet)  were  lysed  in  buffer  by  repeated  passages 
through  a Hamilton  syringe.  Protein  concentrations  were  determined  using 
Coomassie  Plus  reagent  (Pierce). 

Rab  GGTase  activity'*^  was  determined  by  measuring  [®H]  transfer  from 
[®H]geranylgeranyl  pyrophosphate  to  Rabla  in  50  |xl  reactions  containing  50  mM 
sodium  HEPES  (pH  7.2),  5 mM  MgCl2,  1 mM  dithiothreitol,  4 pM  [®H]GGPP 
(2,365  dpm/pmol),  5 pM  recombinant  Rabla,  2 pM  recombinant  REP-1,  and 
indicated  amounts  of  cytosolic  protein  from  gm/gm  or  +/+  tissues.  After 
incubation  for  15  min  at  37°C,  ethanol/HCl-precipitable  radioactivity  was 
measured  on  glass  fiber  filters.  Assays  with  concentrations  of  cytosolic  proteins 
<10  pg  were  carried  out  in  25  pi  reactions  containing  1 pM  [®H]GGPP  (47,300 
dpm/pmol)  and  incubated  for  30  min  at  37°C.  This  transferase  activity  assay  was 
carried  out  by  Dr.  Miguel  Seabra’s  laboratory  (collaborator  and  developer  of 
method)  at  Imperial  College,  London,  UK. 
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Assay  of  GGT ase-l  Activity 

GGTase-l  activity'*^  was  determined  by  Dr.  Seabra’s  lab  by  measuring  pH] 
transfer  from  pHjgeranylgeranyl  pyrophosphate  to  Rad  in  a 25  pi  reaction 
containing  50  mM  sodium  HEPES  (pH  7.2),  150  mM  KCI,  5 mM  MgClj,  1 mM 
dithiothreitol,  1.2  pM  pHjGGPP  (16,133  dpm/pmol),  200  pM  Zwittergent  3-14,  25 
pM  recombinant  Rad , and  the  indicated  amounts  of  cytosolic  proteins.  After 
incubation  for  30  min  at  37°C,  ethanol/HCI  precipitable  radioactivity  was 
measured  as  above. 
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Results  and  Discussion 

Genotyping  of  29  loci  in  3 gm  backcrosses  representing  2462  meioses, 
narrowed  the  gm  critical  region  to  0.64  cM  between  D14Mit63  and  D14MH122 
(Figure  4.1  and  4.2,  Mouse  Genome  Database  [MGD]  accession  J:52722).^® 
Three  crosses  were  utilized  to  minimize  the  effect  of  a recombination  cold  spot 
observed  in  this  region  in  the  [(C57BL/6J-g/r?  x SPRET)Fi  X C57BL/6J-gm] 
backcross.  A consensus  genetic  map  was  constructed  using  the  three 
backcrosses  to  allow  for  physical  contig  construction  (Figure  4.3a).  Genetic 
markers  flanking  and  within  the  gm  critical  region  were  used  to  isolate 
contiguous  yeast  artificial  chromosome  (YAC)  and  bacterial  artificial 
chromosome  (BAG)  clones  encompassing  gm  (Figure  4.3b).  To  complete  the 
cloning  of  this  interval,  novel,  polymorphic  simple  sequence  repeats  {D14Sfk1, 
D14Sfk3,  and  D14Sfk5‘,  Figure  4.3a,  MGD  J:46919)  and  sequence  tagged  sites 
(STS)  were  isolated  from  critical  region  YACs  by  interspersed  repetitive  element 
PGR  (Ghapter).^^  They  were  mapped  (Figure  4.3a,b)  and  used  to  isolate 
additional  YAGs  and  BAGs  until  a tiling  path  of  clones  covered  the  interval 
(Figure  4.3b).  A family  of  serine  protease  genes  {Mcpt1,  Mcpt2,  Mcpt4,  McptS, 
Mcpt9,  McptL,  Ctla1,  CtlaS,  Ctla6,  Ctla7,  and  Ctsg)  were  mapped  within  the  gm 
critical  region  but  were  excluded  from  gm  candidacy  by  virtue  of  incompatible 
distribution  of  expression  or  absence  of  mutations  (data  not  shown). 


41 


a 


769  [(C  57  B L/6  J-pm  x SPRET)F,  X C57BL/6J-gm] 
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727  [(C  57  B L/6  J-srm  x PWK)F,  X C57BL/6J-gm] 


Number  of  mice 
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g m , R a h g g ta  y D 1 4 M it6  4 
D 1 4M  its  4,  Sfk3 
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966  [(C  57  B L/6  J-gm  x DBA)F,  X C 5 7 B L /6  J -g  m ] 


Numberofmice  451  507  2 4 2 


Figure  4.1:  Mouse  chromosome  14  haplotype  analysis  of  three  gm  backcrosses. 
Black  boxes  represent  homozygous  gunmetal  (gm/gm)  mice  based  on  coat 
color.  White  Boxes  represent  heterozygous  gunmetal  (SPRET,  PWK,  or 
DBA/gm)  mice  based  on  coat  color.  Genetic  markers  are  to  the  left  of  the  boxes 
and  are  arranged  proximal  (top)  to  distal  (bottom)  based  on  location.  Numbers 
below  the  boxes  represent  the  number  of  mice  that  coincide  with  the  crossover 
pattern  depicted  by  the  box  color  and  the  genetic  markers,  a,  Haplotype  of  769 
progeny  produced  by  the  [(C57BL/6J-gm  x SPRET)Fi  X C57BL/6J-gm] 
backcross.  Two  mice  have  a crossover  between  D14MH63  and  gm  (etc.),  and 
one  mouse  has  a crossover  between  gm  etc.  and  D14Mit5.  No  crossovers  were 
observed  between  D14MH63  and  D14MH5  in  766  progeny,  b,  Haplotype  of  727 
progeny  produced  by  the  [(C57BL/6J-gm  x PWK)Fi  X C57BL/6J-gm]  backcross. 
Six  mice  have  a crossover  between  Myhca  and  gm  (etc.),  and  four  mice  have  a 
crossover  between  gm  etc.  and  D14MH122.  No  crossovers  were  observed 
between  Myhca  and  D14MH122  in  717  progeny,  c,  Haplotype  of  966  progeny 
produced  by  the  [(C57BL/6J-gn?  x DBA)Fi  X C57BL/6J-gm]  backcross.  Two 
mice  have  a crossover  between  D14Mit63  and  gm  (etc.),  and  six  mice  have  a 
crossover  between  gm  (etc.)  and  D14Mit122  (etc.).  No  crossovers  were 
observed  between  D14Mit63  and  D14MH122  (etc.)  in  958  progeny,  gm  and 
Rabggta  cosegregate  with  each  other  in  all  three  crosses. 
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Figure  4.2:  Mouse  chromosome  14  genetic  ideograms  of  the  three  gm 
backcrosses.  Data  in  Figure  4.1  were  used  in  the  construction  of  these  genetic 
maps.  The  genetic  markers  in  the  vicinity  of  gm  are  shown  to  the  right  of  each 
stick  figure.  The  markers  are  listed  from  proximal  (top)  to  distal  (bottom).  The 
marker  numbers  correspond  to  D14Mit  markers.  The  centiMorgan  (cM) 
distances  between  the  genetic  markers  are  shown  to  the  left  of  each  ideogram. 
Dashed  lines  connect  conserved  genetic  regions  (same  markers)  between  each 
backcross.  gm  and  Rabggta  cosegregate  in  all  three  crosses. 
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Figure  4.3:  Genetic  and  physical  maps  of  mouse  Chr  14  in  the  vicinity  of  gm.^ 
a,  Consensus  genetic  map  of  the  gm  region  on  Chr  14  derived  from  3 
backcrosses  (MGD  J:52722).  The  circle  (left)  represents  the  centromere  of  Chr 
14.  Distances  between  markers  are  in  cM  below  the  chromosome,  while  loci  are 
listed  above.  Numbered  loci  represent  D14Mit  microsatellite  markers.  Relative 
positions  of  loci  were  ascertained  from  769  [(C57BL/6J-gm  x SPRET)Fi  X 
C57BL/6J-gm],  727  [(C57BL/6J-gm  x PWK)Fi  X C57BL/6J-gm],  and  966 
[(C57BL/6J-gm  x DBA)Fi  X C57BL/6J-gm]  backcross  mice.  No  recombinations 
were  found  between  gm  and  Rabggta  in  the  3 backcrosses  (2462  meioses). 
Dotted  lines  link  the  consensus  gm  genetic  map  to  the  physical  map  using  the 
loci  D14Mit63  and  D14Mit122.  b,  Physical  map  of  the  gm  non-recombinant 
interval  (MGD  J:52722).  The  gm  critical  region  is  delineated  by  chromosome 
crossovers  (denoted  by  red  X).  gm  was  flanked  by  D14Mit63  (proximal),  and 
D14MH153  and  D14MH122  (distal).  YAC  (blue  lines)  and  BAC  (red  lines)  clones 
are  shown  below  the  chromosome.  Loci  physically  mapped  within  the  YAC  and 
BAC  clones  are  indicated  with  colored  circles.  The  location  of  Rabggta  is 
indicated  by  red  circles.  Green  circles  indicate  genes  near  gm  that,  c,  also  map 
to  the  homologous  human  chromosomal  segment  (14q11.2).^ 
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Additional  candidate  genes  were  identified  from  pooled  gm  contig  YACs 
and  BACs  using  the  modified  direct  cDNA  selection  method  described  in 
chapter  3 (MGD  # J:51591).^®  Nineteen  of  160  selected  cDNA  fragments  were 
derived  from  a single  candidate,  Rabggta.  Due  to  the  logical  connection 
between  Rabggta  function  and  the  gm  phenotype,  this  gene  was  the  point  of 
further  focus  as  the  leading  candidate  gm  gene.  Rabggta-demed  STS  were 
physically  mapped  on  6 gm  critical  region  YAC  / BAG  clones  (Figure  4.3b;  MGD 
# J:52722).^®  A 2.2  kb  Mus  spretus  SamHI  RFLP  enabled  genetic  mapping  of 
Rabggta  in  the  [(C57BL/6J-gm  x SPRET)Fi  X C57BL/6J-gf/T7]  backcross. 

Rabggta  cosegregated  with  gm  (Figure  4.3a,  MGD  # J:52722),^®  confirming 
derivation  from  the  gm  critical  region. 

Since  mouse  Rabggta  was  novel,  overlapping  cDNAs  encompassing  the 
full-length  coding  domain  were  isolated  and  sequenced  (GenBank  accession 
AF1 27656,  see  Appendix  A).  Mouse  Rabggta  cDNA  shared  94%  and  86% 
nucleotide  identity  to  rat  and  human  Rabggta,  respectively.  Rabggta  is  highly 
conserved,  as  evidenced  by  33%  predicted  amino  acid  identity  between  mouse 
and  the  S.  cerevisiae  homologue,  BET4  (see  Appendix  B). 

Analysis  of  the  genomic  sequence  of  Rabggta  revealed  a single  difference 
between  C57BL/6J-+/+  and  C57BL/6J-g/r7/g/r7  DNA  (GenBank  AF1 27654  and 
AF1 27655,  respectively):  the  normal  3’  terminal  nucleotide  of  intron  a (guanine) 
was  replaced  with  an  adenine  in  gm  DNA  (Figure  4.4a  & c).^  This  substitution 
created  a novel  Tsp509l  restriction  site  that  was  used  to  show  perfect 
cosegregation  of  Rabggta  and  gm  in  2462  meioses  (MDG  # J:52722).^® 
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Figure  4.4:  A splice  acceptor  mutation  in  Rabggta  causes  abnormal  splicing.^ 
a,  genomic  structure  of  the  5’  end  of  mouse  Rabggta.  Exon  a encodes  5’ 
untranslated  region.  The  initiation  methionine  (bold  and  underlined  ATG)  occurs 
at  the  3’  end  of  exon  1.  Rabggta  nucleotide  sequence  above  is  C57BI./6J-+/+, 
and  below  is  C57BL/6J-gm/g/r7.  The  splice  acceptor  mutation  is  indicated  by  an 
asterisk.  Rabggta  exons  connected  by  solid  green  lines  represent  normal 
splicing.  Dashed  red  lines  indicate  exon  1 skipping  in  gmigm  RNA,  and  dotted 
blue  lines  indicate  cryptic  splice  sites  utilized  by  gmigm.  b,  RT-PCR  analysis  of 
mRNA  from  wild-type  (+/+)  and  mutant  {gm/gm)  bone  marrow.  The  Rabggta 
oligonucleotides  used  amplified  the  region  between  exon  a and  exon  2.  Normal 
bone  marrow  (+/+)  shows  the  expected  product  of  486  bp  (band  A)  (GenBank 
AF127656),  and  a product  (638  bp)  retaining  intron  a (band  B)  (GenBank 
AF1 27658).  gm/gm  bone  marrow  shows  3 bands:  Band  C (429  bp)  (GenBank 
AF1 27657)  represents  exon  1 skipping;  Band  B’  (638  bp)  (GenBank  AF1 27659) 
retains  intron  a;  Band  D (557  bp)  (GenBank  AF1 27662)  represents  utilization  of 
cryptic  splice  sites  in  intron  a and  exon  1.  c,  Sequence  analysis  of  Rabggta 
genomic  DNA.  Arrows  indicate  the  splice  acceptor  mutation:  +/+  =G,  gm/gm  = 
A,  and  gml+  exhibits  both  nucleotides  (G  and  A). 


46 


Typically,  substitution  of  the  obligatory  3’-terminal  guanine  of  the 
consensus  splice  acceptor  site  prevents  normal  mRNA  splicing,  and  leads  to 
intron  retention,  exon  skipping,  or  cryptic  splice  site  utilization/®  Reverse 
transcription  and  PCR  (RT-PCR)  of  the  5’  end  of  Rabggta  RNA  (corresponding 
to  exons  a,  1 , and  2)  disclosed  aberrantly  spliced  Rabggta  transcripts  in  gm 
bone  marrow  (Figure  4.4b)/®  This  tissue  was  used  because  the  hematologic 
abnormalities  of  gm  homozygotes  are  corrected  by  bone  marrow 
transplantation.^^  In  normal  mice,  RT-PCR  of  the  5’  end  of  Rabggta  RNA  gave  2 
products.  The  most  abundant  product  was  correctly  spliced  (exon  a - exon  1 - 
exon  2;  Figure  4.4b,  +/+  product  A).  A second,  minor  product  observed  in 
normal  mice  retained  intron  a (exon  a - intron  a - exon  1 - exon  2;  Figure  4.4b, 
+/+  product  B)  and  may  be  immature,  since  it  could  only  be  marginally  amplified 
by  PCR  from  cytosolic  or  poly(A)+  RNA  (data  not  shown). 

In  gm  homozygotes,  the  major  Rabggta  transcript  of  +/+  bone  marrow 
was  not  observed  (product  A,  exon  a - exon  1 - exon  2).  Instead,  the  most 
abundant  gm  Rabggta  product  was  novel  and  skipped  exon  1 (Figure  4.4b, 
gm/gm  product  C;  exon  a - exon  2).  Translation  of  this  transcript  was  not 
anticipated  to  yield  functional  Rabggta  protein  since  the  initiation  codon  was  lost, 
and  no  alternative,  in-frame  start  codon  was  present  (Figure  4.4b). 

A second  novel,  but  much  less  abundant,  gm  Rabggta  product 
represented  activation  of  a cryptic  splice  donor  (within  intron  a,  84  bp 
downstream  of  the  normal  splice  donor  site)  and  acceptor  sites  (within  exon  1 , 

13  bp  downstream  of  the  G^A  substitution)[Figure  4.4b,  gm/gm  product  D]. 
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Importantly,  utilization  of  these  cryptic  splice  sites  did  not  result  in  removal  of  the 
initiation  codon.  The  existence  of  this  less  abundant  transcript  is  a logical 
explanation  for  the  presence  of  Rabggta  in  gm  platelets,  at  a reduced  level  (see 
Figure  4.5). 

Like  +/+  bone  marrow,  a rare  product  that  retained  intron  a was  amplified 
from  gm/gm  RNA  (exon  a - intron  a - exon  1 - exon  2;  Figure  4.4b,  product  B’). 
This  gm/gm  transcript  was  also  anticipated  to  be  functional  if  polyadenylated, 
since  the  initiation  codon  of  Rabggta  is  in  exon  1 , and  therefore  the  retention  of 
intron  a (with  the  G-^A  substitution)  should  not  disrupt  translation.  However, 
RT-PCR  of  cytosolic  and  poly(A)+  RNA  suggested  that  it  was  an  immature,  non- 
polyadenylated  RNA,  like  the  corresponding  transcript  in  +/+  bone  marrow  (data 
not  shown). 

These  aberrant  Rabggta  splice  variants  were  found  in  all  tissues  tested 
{gm  bone  marrow,  melanocyte,  and  kidney:  data  not  shown).  As  expected,  both 
wild  type  and  mutant  products  were  observed  in  gm/+  heterozygote  RNA  (data 
not  shown).  No  other  defect  in  splicing  was  observed;  other  regions  of  Rabggta 
mRNA  produced  RT-PCR  products  of  expected  size  and  sequence  in  +/+  and 
gm/gm,  with  the  exception  of  a Rabggta  isoform  that  retained  intron  9,  which  was 
found  in  both  +/+  and  gm/gm  mice  (GenBank  accession  AF1 27660,  AF1 27661). 
The  reduction  in  Rabggta  transcript  levels  in  gm  homozygotes  was  specific: 
Northern  blots  and  quantitative  PCR  revealed  no  changes  in  the  abundance  of 
transcripts  of  other  Rab  geranylgeranyl  transferase  subunits  (Rab  geranylgeranyl 
transferase  subunit  (i,  Rab  escort  protein-1  or -2)  in  bone  marrow  or  kidney  of 
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gm  homozygotes  (data  not  shown).  While  absolute  determination  of  the  relative 
amounts  of  Rabggta  mRNA  isoforms  in  gm/gm  and  +/+  tissues  was  not  possible 
(since  the  size  differences  could  not  be  resolved  on  Northern  blots),  no 
difference  was  observed  between  gm/gm  and  +/+  tissues  in  the  overall 
abundance  of  Rabggta  mRNA  (data  not  shown).  Thus,  the  definition  of  a gm- 
specific  mutation  in  Rabggta  was  extremely  good  evidence  that  this  was  the  gm 
gene.  However,  biochemical  analysis  was  also  undertaken  to  show  that  this 
gene  was  responsible  for  the  gm  phenotype. 

Western  blots  of  gm  platelets  showed  that  the  apparent  consequence  of 
aberrant  Rabggta  mRNA  splicing  was  ~70%  reduction  in  60-kDa  Rabggta  protein 
in  gm  platelets  (Figure  4.5).^®  It  is  likely  that  residual  Rabggta  protein  in  gm/gm 
platelets  resulted  from  translation  of  the  novel  gm  Rabggta  transcript  that  utilized 
cryptic  splice  sites  as  described  above  (Figure  4.4b,  gm/gm  product  D). 


Figure  4.5:  Decreased  Rabggta  protein  levels  in  gm/gm  mice.®®  Western  blots  of 
wild-type  (+/+),  heterozygous  {gml+),  and  mutant  (gm/gm)  platelets  probed  with 
Rabggta  and  (3-actin  antibodies  show  the  60-kDa  Rabggta  signal  reduced  ~70% 
in  gm/gm,  as  compared  to  +/+.  Platelets  from  two  different  mice  from  each 
genotype  were  analyzed  for  consistency. 
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Rabggta  is  part  of  Rab  GGTase,  a heterotrimeric  protein  comprising  an  a- 
and  p-subunits,  and  Rab  escort  protein. Rab  GGTase  activity  in  gm/gm  and 
+/+  platelets  was  examined  with  an  in  vitro  prenylation  assay  (Figure  4.6a). 
gm/gm  Rab  GGTase  activity  was  reduced  ~70%  compared  with  +/+  platelets 
(Figure  4.6a).  Similar  results  were  obtained  with  liver,  kidney,  and  spleen 
extracts  (Figure  4.6a  and  data  not  shown).  It  is  unlikely  that  these  results 
reflected  prenylation  of  Rab1a  by  another  enzyme  (such  as  GGTase-l  or 
farnesyl-transferase)  since  other  enzymes  do  not  prenylate  this  substrate.'*®  "^  In 
addition,  the  decrease  in  Rab  GGTase  activity  matched  the  reduction  in  Rabggta 
protein.  The  gm  reduction  in  Rab  GGTase  was  specific:  no  difference  was 
observed  between  gm/gm  and  +/+  in  activity  of  GGTase-l,  a related  enzyme 
(Figure  4.6b).  Thus,  the  Rabggta  splicing  defect  in  gm  results  in  a biochemical 
defect:  a partial  decrease  in  Rabggta  levels  and  Rab  GGTase  activity.  It  is 
interesting  that  while  all  gm  tissues  showed  decreased  Rab  GTPase  activity, 
phenotypic  abnormalities  were  observed  only  in  gm  platelets  and  bone  marrow. 
The  sensitivity  of  platelets  to  Rab  GGTase  deficiency  could  be  related  to  lack  of 
a nucleus,  and  therefore  decreased  capacity  for  de  novo  protein  synthesis;  or, 
for  reasons  unknown,  platelets  require  more  Rab  GGTase  for  normal  function 


than  other  tissues. 
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Figure  4.6:  Decreased  Rab  GGTase  activity  in  gmigm  mice.^®  a,  Rab  GGTase 
activity  of  gm/gm  and  +/+  platelets  show  ~70%  reduction  in  gmigm  platelets. 
Duplicate  cytosolic  protein  samples  from  pooled  platelets  (3  mice  each)  from 
gunmetal  {gm/gm,  open  symbols)  and  C57BL/6J  mice  (+/+,  solid  symbols)  were 
assayed  for  Rab  GGTase  activity  by  transfer  of  isoprenoid  from  ®H- 
geranylgeranyl-pyrophosphate  to  Rab1a.  b,  GGTase-l  activity  of  gm/gm  and  +/+ 
pooled  platelets  show  no  difference  between  the  two.  GGTase-l  activity  was 
determined  by  transfer  of  isoprenoid  from  ®H-geranylgeranyl-pyrophosphate  to 
Rad.  Each  point  represents  an  average  of  duplicate  reactions.  For  Rab 
GGTase  reactions,  a control  reaction  incubated  following  addition  of  50  mM 
EDTA  was  subtracted  from  all  samples.  For  GGTase-l  reactions,  a control 
reaction  incubated  with  buffer  alone  was  subtracted. 


A previous  study  showed  that  expression  of  some  small  GTP-binding 
proteins  was  abnormal  in  gm  platelets.^®  Novel  bands  (~25-  and  ~28.5-kD)  of 
unknown  identity  were  observed  on  GTP  overlays  of  Western  blots  of  gm 
platelets  (Figure1.3  and  4.8).’®  ®®  These  were  not  expressed  in  gm  liver,  kidney, 
brain,  spleen,  macrophages  or  neutrophils.’®  Rab  proteins  represented 


51 


candidates  for  these  abnormal  GTP  binding  proteins.  Specifically,  Rab27a  was 
an  excellent  candidate  since  it  is  abundant  in  melanocytes  and  platelets,'*®  and  is 
a substrate  for  prenylation  by  Rab  GGTase.'*^  Immunoblots  disclosed  a single 
Rab27a  band  of  27.5-kD  in  normal  platelets  and  bone  marrow,  but  an  additional 
mobility-shifted  (28.5-kD)  isoform  in  gm/gm  (Figure  4.7).®®  This  isoform  may 
represent  unprenylated  Rab27a  since  the  electrophoretic  mobility  of  unmodified 
Rab  proteins  is  slower  than  prenylated  forms  in  SDS-containing  gels.'*^  '*®  Not  all 
small  GTPases  showed  abnormal  mobility:  no  differences  between  gm/gm  and 
+/+  platelets  were  observed  on  immunoblots  of  Rab4  or  Ras.*® 
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Figure  4.7:  Deficient  prenylation  of  Rab27a  in  gm  platelets.®®  Western  blots  of 
wild-type  (+/+),  heterozygous  {gml+),  and  mutant  (gm/gm)  platelets  and  bone 
marrow  incubated  with  Rab27a  and  p-actin  antibodies  identifies  a shift  in  the 
gm/gm  Rab27a  lanes.  This  indicates  hypoprenylation  of  Rab27a  in  gm/gm 
platelets.  Platelets  from  two  different  mice  from  each  genotype  were  analyzed 
for  consistency. 
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Additional  evidence  for  Rab27a  hypoprenylation  in  gm  was  sought  by 
examining  gm  platelets  for  membrane-associated  Rab27a,  since  prenylation  is 
required  for  membrane  binding  of  Rab27a.'‘^  In  analysis  of  subcellular  protein 
fractions  of  platelets,  Rab27a  was  mainly  membrane-associated  in  normal 
platelets,  but  predominantly  in  the  soluble  fraction  of  gm  platelets  (Figure  4.8).^® 
GTP  overlay  studies  revealed  that  the  distribution  and  mobility  of  the  abnormal 
Rab27a  band  matched  one  of  the  previously  described  novel  gm  GTP  binding 
proteins  (Figure  1.3  and  Figure  4.8).^®  ®®  Finally,  immunohistochemical  studies 
demonstrated  punctate  distribution  of  Rab27a  in  normal  mouse  platelets, 
consistent  with  localization  on  intracellular  granules  (Figure  4.9).®®  In  contrast, 
gm/gm  platelets  exhibited  more  diffuse  cytoplasmic  Rab27a  staining  (Figure  4.9). 
This  evidence  suggests  that  Rab  GGTase  deficiency  in  gm  platelets  results  in 
deficient  Rab27a  prenylation.  Rab27a  may  be  particularly  affected  in  gm 
platelets  because  it  has  a low  affinity  for  the  REP2-containing  Rab  GGTase 
holoenzyme,  and  competes  poorly  with  other  prenylation  substrates  when  Rab 
GGTase  is  limited.'*^"® 
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Figure  4.8:  Deficient  prenylation  of  Rab27a  in  gm  platelets.^®  Western  blots  of 
subcellular  fractions  of  wild-type  (+/+)  and  mutant  (gm/gm)  platelets  incubated 
with  radiolabelled  GTP  or  Rab27a  antibodies.  The  subcellular  fractions  were 
total  protein  (T),  membrane-associated  protein  (M),  and  soluble-fraction  protein 
(S)]. 


gm/gm 


Figure  4.9:  Differential  intracellular  localization  of  Rab27a  in  gm  platelets 
compared  to  wild-type  platelets.®®  Photomicrographs  (500X)  of  +/+  and  gm/gm 
platelets  stained  with  Rab27a  antibodies,  and  detected  with  an  FITC-conjugated 
secondary  antibody. 
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In  this  chapter  I have  described  additional  genetic  and  physical  mapping 
to  narrow  the  gm  critical  region  and  localize  candidate  genes  relative  to  the  gm 
loci.  Rabggta  was  analyzed  using  genetic  and  biochemical  assays,  and  was 
shown  to  be  the  causative  gene  of  The  facts  that  (1)  this  mutation  allows 
leaky  expression  of  Rabggta,  and  (2)  only  this  one  gm  allele  exists,  suggests  that 
homozygosity  for  completely  null  or  inactivating  Rabggta  mutations  might  be 
lethal,  and  that  the  gm  mouse  produces  enough  Rabggta  to  allow  viability.  As 
discussed  in  the  next  chapter,  the  discovery  that  deficient  Rab  GGTase  causes 
gm  is  of  considerable  significance  for  the  study  of  platelet  biology  and  other 
hematological  conditions. 


CHAPTER  5 

CONCLUSIONS  AND  FUTURE  DIRECTIONS 


Despite  numerous  studies,  relatively  few  molecular  mechanisms  or 
biochemical  pathways  regulating  platelet  production  by  megakaryocytes  are 
known.  This  is  a significant  deficiency,  since  inherited  and  acquired  platelet 
defects  are  not  uncommon,  and  currently  are  treated  with  repeated  platelet 
transfusions,  which  are  very  costly,  may  transmit  infectious  diseases,  and  may 
result  in  alloimmunization.^’^  Identification  of  the  molecular  basis  of  Mendelian 
mouse  models  of  platelet  defects  is  an  effective  way  to  identify  new  and 
important  pathways  or  factors  involving  platelet  production  from  megakaryocytes. 
Therefore,  the  main  objective  of  this  dissertation  work  was  to  positionally  clone 
and  characterize  the  gene  responsible  for  the  mouse  mutation  gunmetal,  in  order 
to  gain  insights  into  the  biological  processes  involving  hematopoiesis.  Five 
specific  aims  were  established  in  order  to  achieve  this  objective: 

1 . Create  a precise  genetic  map  of  mouse  Chr  14  in  the  vicinity  of  gm. 

2.  Generate  a physical  map  spanning  the  gm  critical  region. 

3.  Identify  gm  candidate  genes  based  on  location,  function,  & expression. 

4.  Screen  gm  candidate  cDNAs  for  mutations. 

5.  Characterize  the  gm  gene. 

Below,  the  results  for  each  specific  aim  are  summarized,  followed  by  final 
discussion  and  future  directions. 
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Summary  of  Results 

Create  a Precise  Genetic  Map  of  Mouse  Chr  14  in  the  Vicinity  of  am 

Three  large  gm  backcrosses  were  constructed  at  the  outset  of  this  study, 
resulting  in  2462  meioses.  Twenty-nine  polymorphic  loci  (some  derived  as 
described  below)  in  the  vicinity  of  gm  were  genotyped  in  the  three  backcrosses, 
thus  narrowing  the  genetic  interval  to  0.64  cM  between  D14Mit63  and  D14Mit122 
(Figure  4.1 , 4.2,  and  4.3).^®  Three  crosses  were  utilized  to  minimize  the  effects 
of  observed  polymorphism  deficiencies  and  recombination  cold  spots. 

Generate  a Physical  Map  Spanning  the  am  Critical  Region 

A consensus  genetic  map  of  the  gm  critical  region  (Figure  4.3a)®®  was 
constructed  from  the  backcross  haplotype  data  (Figure  4.1).  Genetic  markers 
flanking  and  within  the  critical  region  were  used  to  isolate  contiguous  YAC  and 
BAC  clones  that  encompass  gm.  To  complete  the  cloning  of  the  gm  interval, 
novel  polymorphic  markers  (SSLPs)  and  sequence  tagged  sites  (STSs)  were 
isolated  using  interspersed  repetitive  element  (IRE)-PCR.^® 

IRE-PCR  is  a useful  method  for  identification  of  novel  human  or  mouse 
(STSs)  from  contigs  of  genomic  clones  (Figure  2.2).  I used  IRE-PCR  with  mouse 
B1  repetitive  element  primers  to  generate  novel,  PCR  amplifiable,  simple 
sequence  length  polymorphisms  (SSLPs)  from  yeast  artificial  chromosome 
(YAC)  clones  containing  regions  of  mouse  chromosomes  14  (see  Chapter  2, 
Table  2.1).®®  Twenty-nine  out  of  42  IRE-PCR  clones  contained  multiple  simple 
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sequence  repeat  units.  PCR  analysis  with  primers  derived  from  unique 
sequences  flanking  the  simple  sequence  repeat  units  in  seven  clones  showed  all 
to  be  polymorphic  between  various  mouse  strains.  This  novel  approach  to  SSLP 
identification  represents  an  efficient  method  for  saturating  a genomic  interval  with 
polymorphic  genetic  markers  that  may  expedite  (as  it  did  in  this  project)  the 
positional  cloning  of  genes  for  traits  and  diseases. 

Identify  am  Candidate  Genes  Based  on  Location,  Function,  and  Expression 

After  an  overlapping  physical  contig  of  the  gm  critical  region  was 
constructed,  I used  direct  cDNA  selection  to  isolate  and  identify  the  genes  that 
lie  within  the  YAC  and  BAG  clones  of  the  contig  (see  Chapter  3).^®  Direct  cDNA 
selection  is  a powerful  method  to  identify  candidate  genes  in  a defined  genomic 
region  for  positional  cloning  projects  (Figure  3.1). 

To  overcome  some  of  the  limitations  associated  with  isolating  genomic 
template  DNA  for  the  direct  selection  procedure,  I used  IRE-PCR  with 
biotinylated  mouse  B1  and  B2  repetitive  element  primers  to  rapidly  generate 
uncontaminated  genomic  template  DNA  from  yeast  artificial  chromosome  (YAC) 
clones  (Figure  2.2  and  3.1).  The  resulting  genomic  template  DNA  was 
successfully  used  in  the  direct  cDNA  selection  procedure  described  by  Del 
Mastro  and  Lovett,  resulting  in  identification  of  5 known  genes  and  21  novel 
ESTs  (Table  3.1).®®  One  of  the  genes  isolated  (Rabggta)  was  later  determined  to 
be  the  causative  gene  of  gm  (see  Chapter  4),®®  illustrating  that  our  novel 
modification  of  the  template  preparation  procedure  for  direct  cDNA  selection  is 
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effective  and  may  help  expedite  (as  it  did  in  this  project)  positional  cloning  of 
disease  genes. 

Screen  am  Candidate  cDNAs  for  Mutations 

Several  genes  were  screened  for  mutations  in  gm  mice.  Genomic 
sequence  of  Rab  geranylgeranyl  transferase,  alpha-subunit  (Rabggta)  revealed 
a G ^ A point  mutation  in  the  last  nucleotide  of  the  3’  splice  acceptor  in  intron  a 
of  gm/gm  DNA  when  compared  to  +/+  DNA  (see  Chapter  4,  Figure  4.4a, c).^®  As 
expected,  RT-PCR  oi  gm/gm  mRNA  showed  that  substitution  of  this  obligatory 
3’-terminal  guanine  of  the  spice  acceptor  site  prevented  normal  splicing  and  led 
to  exon  1 (contains  start  codon)  skipping,  intron  a retention,  and  cryptic  splice 
site  utilization  (Figure  4.4a, b).®® 

Due  to  the  substantial  loss  of  exon  1 and  its  start  codon  during  Rabggta 
splicing  in  gm/gm  mRNA,  a significant  deficiency  (~70%  loss)  of  Rabggta  protein 
in  gm/gm  platelets  was  identified  on  Western  blots  (Figure  4.5).®®  This  protein 
deficiency  correlated  with  an  ~70%  reduction  in  Rab  GGTase  activity  in  gm/gm 
mice  (Figure  4.6).®®  This  excessive  loss  of  Rab  GGTase  prenylation  activity 
points  to  an  explanation  of  the  hematologic  defects  associated  with  the  gm 
mouse  and  may  lead  to  a better  understanding  of  platelet  biogenesis  (see 


below). 
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Characterize  the  am  Gene 

Rabggta  not  only  fits  as  the  causative  gene  of  gm  based  on  its  genetic 
location  and  identified  point  mutation,  but  also  based  on  its  function.  Rabggta  is 
the  alpha  subunit  of  Rab  GGTase.  Rab  GGTase  is  a heterodimer  composed  of 
tightly  bound  a-  and  p-subunits.  It  is  inactive  on  its  own,  requiring  another 
component,  Rab  Escort  Protein  (REP),  for  activation  so  that  it  may  transfer  two 
20-carbon  geranylgeranyl  groups  to  Rab  proteins  (Rab  GTPases)  that  bear 
double  cysteine  motifs  at  their  COOH-terminus.^^’^  Rab  GTPases  then  interact 
with  membranes  to  regulate  vesicular  transport  steps  in  endocytic  and  secretory 
pathways  by  acting  as  molecular  switches  oscillating  between  active  GTP-bound 
and  inactive  GDP-bound  states  (Figure  5.1). 


Figure  5.1:  Diagrammatic  representation  of  Rabggta’s  role  in  Rab  prenylation 
and  membrane  attachment,  (g)  diamonds  represent  geranylgeranyl  groups. 
Rab  GGTase  components  are  marked  a,  p,  and  REP-1.  Top  arrow  (+/+) 
represents  normal  prenylation  of  Rab27a.  Bottom  two  dashed  arrows  {gm/gm) 
represent  reduced  prenylation  of  Rab27a  due  to  substantial  deficiency  of  the 
Rabggta  subunit. 
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Discussion 

Most  gm  hematologic  defects  are  explicable  on  the  basis  of  impaired 
prenylation  by  Rab  GGTase  of  substrates,  such  as  Rab27a,  ablating  ability  of 
these  molecules  to  attach  to  membranes  or  regulate  intracellular  protein 
transport.®^’®®  ''^'''^’®'’  Rab  proteins  regulate  intracellular  vesicular  transport  by 
cycling  between  membrane-attached,  GTP-bound  and  cytosolic,  GDP-bound 
states.®®’®®'®®  Intracellular  protein  trafficking  and  vesicular  transport  are 
disordered  in  Proteins  destined  for  delivery  from  the  trans-Golgi  network 

to  a-  and  5-granules  are  misrouted  in  gm  platelets.  As  a result,  gm  platelet  a- 
granules  lack  fibrinogen,  von  Willebrand  factor  (vWF)  and  platelet  factor  4.  vWF 
is  secreted  from  gm  megakaryocytes  instead  of  being  transported  to  a- 
granules.^® 

In  addition  to  qualitative  defects,  the  number  of  platelet  a-  and  5-granules 
and  platelet  synthesis  is  decreased  in  gm  mice.  Platelet  synthesis  is  a complex 
process  involving  generation  of  large  membrane  complexes  within 
megakaryocytes.  The  distribution  of  these  membrane  complexes  is  abnormal  in 
gm  megakaryocytes,  and  gm  platelets  are  less  common,  larger  and  more 
heterogeneous  in  size  than  normal. Importantly,  the  association  of  Rab 
GGTase  with  maintenance  of  normal  platelet  count  and  bleeding  time  implies 
that  Rab  GGTase  is  a therapeutic  target  for  thrombocytosis  and  clotting 
disorders  (such  as  myocardial  infarction,  stroke,  or  deep  venous  thrombosis). 
Compounds  that  selectively  inhibit  Rab  GGTase  have  been  developed®'*  and 
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may  be  effective  in  these  disorders  if  dosed  to  mimic  gm  by  reducing  platelet 
Rab  GGTase  activity  by  ~70%,  and  thus  reducing  overall  clotting  ability. 

The  molecular  pathology  of  gm  is  strikingly  similarity  to  the  X-linked 
human  disorder  choroideremia  (CHM).  In  both,  mutated  genes  encoding 
subunits  of  Rab  GGTase  result  in  partial  enzyme  deficiency:  Rabggta  is  mutated 
in  gm  and  Rab  geranylgeranyl  transferase  component  A (REP1)  is  mutated  in 
CHM.®®  Retention  of  partial  Rab  GGTase  activity  in  gm  probably  reflects 
aberrant  splicing  that  rescues  certain  Rabggta  transcripts.  In  CHM,  partial 
activity  remains  because  of  two  redundant  loci  encode  Component  A of  Rab 
GGTase  (REP1  and  REP2).®®  ®®  In  both,  a consequence  of  partial  deficiency  of 
Rab  GGTase  is  decreased  Rab27a  prenylation,  as  evidenced  by  change  in 
mobility  and  membrane-association.'*^  The  phenotype  of  gm,  however,  is  rather 
different  from  CHM.  Melanocytes  are  abnormal  in  both:  coat  color  is  diluted  in 
gm,  and  retinal  melanocytes  are  one  of  the  first  cell  types  affected  in  CHM.'*^ 
However,  albinism  does  not  occur  in  CHM,  nor  have  retinal  abnormalities  been 
observed  in  gm.  Furthermore,  CHM  patients  do  not  have  prolonged  bleeding 
time  or  platelet  storage  pool  deficiency.  These  differences  may  reflect  genetic 
complementation  by  REP2  in  CHM  platelets,  but  not  gm.  They  may  also  reflect 
species  differences,  as  suggested  by  phenotypic  differences  between  CHM 
patients  and  Chm  knockout  mice.®^ 

The  human  genetic  disorders  most  similar  to  gm  are  gray  platelet 
syndrome  (GPS)  and  a/5-SPD.^^  *®  ®®  Like  gm,  GPS  platelets  are  reduced  in 
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number,  increased  in  size,  and  lack  a-granule  proteins.^®®®  Mis-sorting  and 
inappropriate  secretion  of  a-granule  proteins  occurs  in  GPS,  as  in  grm.®®  GPS 
patients,  however,  do  not  exhibit  6-granule  storage  pool  deficiency  or  partial 
albinism.  Like  gm,  a/6-SPD  is  characterized  by  reduction  in  the  number  and 
contents  of  both  a-  and  6-granules.”  Thus,  RABGGTA  is  an  excellent  candidate 
gene  for  GPS  and  a/6-SPD. 

Future  Directions 

An  important  initiative  is  to  identify  the  human  disease(s)  caused  by 
mutations  in  RABGGTA  (if  any).  This  is  often  the  ultimate  goal  of  positionally 
cloning  disease  genes  using  mouse  models.  As  mentioned  above,  GPS  and 
a/6-SPD  are  good  human  candidate  diseases  for  RABGGTA  mutations. 
Currently,  patient  blood  samples  are  being  sought  for  further  analysis.  If 
possible,  both  RNA  and  DNA  will  be  isolated  from  the  patient  samples  to  be  used 
in  RT-PCR,  Southern,  Northern,  and  Western  analyses.  Additionally,  direct 
sequence  analysis  will  be  done  on  both  cDNA  and  DNA,  including  all  intron/exon 
boundaries.  Furthermore,  some  of  the  above  mentioned  experiments  may  or 
may  not  be  necessary  based  on  RT-PCR  and  other  preliminary  tests.  Favorable 
outcome  of  these  analyses  may  lead  to  better  diagnostics,  treatments,  and 
understanding  of  hematological  diseases.  However,  since  it  is  likely  that  a gm 
null  mutant  would  be  lethal  (based  on  my  observation  that  the  gm  mutation  is 
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leaky),  I suspect  this  to  be  the  case  in  humans  as  well,  and  thus  it  would  not  be 
unexpected  to  find  the  human  mutation  to  be  leaky  as  it  is  in  gm. 

If  a mutation  in  RABGGTA  is  found  to  cause  a hematological  disease  in 
humans  (such  as  GPS  or  a/5-SPD),  it  is  plausible  that  gene  therapy  could  be 
used  for  treatment.  Experimental  evidence  that  shows  reciprocal  bone  marrow 
transplants  between  gm/gm  and  gm/+  mice  correct  or  cause  the  mutant 
phenotype^^  suggests  that  clinical  abnormalities  can  be  permanently  corrected  in 
patients  with  these  conditions.  Since  patient  age  (these  conditions  are  not 
immediately  life  threatening)  and  cDNA  size  (only  2.4  kb)  do  not  appear  to  be 
limiting  factors,  it  may  be  feasible  to  remove  bone  marrow  (relatively  easy  to 
isolate  and  manipulate)  from  an  affected  human  patient,  transfect  stem  cells  with 
a normal  RABGGTA-producing  construct  into  the  bone  marrow,  and  then 
transplant  it  back  into  the  patient.  Such  a gene  therapy  approach  could  be 
tested  and  perfected  using  gm  mice  first. 

Another  future  direction  for  the  gunmetal  project  involves  derivation  of 
mouse  transgenics  to  test  whether  re-introduction  of  Rabggta  corrects  the  gm 
phenotype.  At  present,  experiments  to  introduce  cDNA  and  BAG  transgenes  of 
the  normal  Rabggta  gene  into  gm  mice  are  in  their  early  stages.  With  the 
assistance  of  collaborators  at  Baylor  College  of  Medicine,  we  will  test  whether 
the  gm  phenotype  in  recessive  gm  mice  is  corrected  by  creating  heterozygous 
transgenics  carrying  a normal  copy  of  Rabggta.  Both  cDNA  and  BAG  strategies 
will  be  attempted,  to  overcome  the  limitations  associated  with  each  type  of 
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transgene.  Currently,  a full-length  (5’UTR  through  3’UTR)  Rabggta  cDNA 
product  has  been  cloned  (using  TOPO  TA  kit)  and  sequenced  (no  point 
mutations  present).  The  next  step  will  be  to  clone  the  Rabggta  cDNA  into  a 
vector  with  a strong  universal  eukaryotic  promoter  (e.g.  pCAGGS).®°  The 
construct  will  be  sent  to  our  collaborators  at  Baylor,  who  already  have 
established  the  gm  mouse  colony  on  site.  They  will  inject  the  cDNA  construct 
into  gm/gm  fertilized  oocytes,  and  then  transfer  these  oocytes  into  the  uterus  of 
a pseudo-pregnant  wild-type  mouse.  All  progeny  will  be  gm  homozygotes  at  the 
original  locus,  but  will  not  have  the  diluted  coat  color  (or  mutant  platelet 
phenotype)  if  the  transgene  has  incorporated  correctly  and  is  producing  sufficient 
Rabggta.  As  a caveat,  over-expression  of  the  cDNA  transgene  could  cause 
unanticipated  problems  in  scoring  phenotype.  Thus,  BAG  transgenics  may  also 
be  utilized  to  test  the  effects  of  the  gene  under  its  normal  expression  controls. 
BACs,  however,  are  more  difficult  to  manipulate  due  to  their  large  size,  and  the 
possibility  exists  that  they  may  not  contain  the  complete  promoter  and/or 
enhancer  systems  necessary  for  normal  expression.  Considering  all  such 
factors,  a transgenic  result  of  an  abnormal  phenotype  does  not  dis-prove  that 
Rabggta  is  the  gm  gene.  However,  if  it  is  successful,  it  would  provide  final  proof 
that  Rabggta  is  the  gene  that  causes  gm.  Furthermore,  even  if  complete 
phenotype  correction  is  not  seen,  these  experiments  may  still  produce  valuable 
information  concerning  protein  over-expression  or  mis-expression. 

Another  future  project  related  to  this  work  concerns  the  beta  subunit 
(Rabggtb)  of  the  Rab  GGTase  heterodimer.  At  present,  no  human  or  mouse 
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diseases  have  been  reportedly  linked  to  mutations  in  Rabggtb.  However,  since 
two  of  the  three  components  of  the  Rab  GGTase  heterodimer  have  been  found 
to  be  perturbed,  causing  disease  (REP-1  causes  CHM  and  Rabggta  causes 
36,42  jg  perfectly  reasonable  that  the  third  component  (Rabggtb)  also  causes 
disease  when  mutated. 

Initial  mapping  has  localized  RABGGTB  to  human^®  1p31  and  mouse®^ 

Chr  3,  cM  76.5.  Interestingly,  also  localized  to  mouse  Chr  3 (cM  74.8)  is  the 
semidominant  mouse  mutation  varitint-waddler  (\/a).®^  Heterozygous  mutants 
exhibit  reduced  fertility,  deafness,  and  minor  head-tossing  and  hyperactivity. 
Homozygous  mutants  exhibit  severe  infertility,  high  mortality,  and  intense 
behavioral  abnormalities.  Furthermore,  like  gm  homozygotes,  the  Va  mutants 
express  coat  color  dilutions  as  well.  Heterozygous  coats  are  variegated  with 
patches  of  normal-colored,  diluted,  and  white  fur.  Homozygotes  show  more 
dilute  colors;  their  coats  are  white,  except  for  small  patches  of  unaltered  color 
near  the  ears  and  base  of  the  tail.®^  The  pathological  changes  in  heterozygotes 
include  degeneration  of  the  organ  of  Corti,  stria  vascularis,  spiral  ganglion, 
saccular  macula,  cristae  ampullares,  and  vestibular  ganglion.  In  homozygotes, 
the  degenerative  changes  are  more  severe  and  also  include  the  utricular 
macula.®® 

Although  the  characteristics  associated  with  Va  do  not  parallel  those  of 
gm,  neither  are  gm  characteristics  similar  to  features  of  CHM.  Furthermore, 
Rab3a  (which  is  known  to  be  modified  by  Rab  GGTase)  has  been  associated 
with  neurological  development  since  it  controls  exocytosis  in  neuroendocrine  and 
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neuronal  cells.®''  For  these  reasons,  it  is  plausible  that  a mutation  in  Rabggta 
could  cause  Va.  A positional  candidate  approach  could  be  used  to  investigate 
this  disease  in  the  mouse  model,  as  well  as  other  possible  candidate  diseases 
that  may  exist  in  both  humans  and  mouse.  Currently,  only  Va  homozygous  DNA 
is  available  from  Jackson  Labs,  as  the  mouse  (C57BL/6J-\/a)  exists  only  as  a 
frozen  embryo.  At  a cost  of  $500,  Jackson  labs  could  provide  these  mice  within 
3 months.  During  the  3-month  waiting  period,  preliminary  Southern  analysis 
could  be  done  using  VaA/a,  Va/+,  and  +/+  DNA,  hybridized  to  a Rabggtb  probe. 
When  RNA  became  available  from  the  mice,  RT-PCR  and  Northern  analysis 
could  be  done  to  further  assay  for  mutations.  Additionally,  since  Rabggtb  is 
relatively  small  (1011  bp  coding  region),  sequence  analysis  could  be  achieved 
quickly  and  at  a fairly  low  cost.  Should  these  analyses  fail  to  identify  a mutation, 
a more  conventional  positional  cloning  approach  could  be  undertaken  if  so 


desired. 


APPENDIX  A 

MOUSE  RABGGTA  SEQUENCES  SUBMITTED  TO  GENBANK 


GenBank  Submission  AF1 27654 


Submission  1 of  a set  of  9 submission ( s ) . 


Comment:  Intron  alpha  (bp  125-276),  Intron  1 (bp  334-471),  bp  276  contains  (g) 
which  is  mutated  to  (a)  in  the  gunmetal  mutant  strain  (sequence  will  be  submitted 
for  this  in  the  next  submission) . 


LOCUS  XXXXX  2685  bp  DNA  ROD  ll-FEB-1999 

DEFINITION  Mus  musculus  RAB  geranylgeranyl  transferase,  alpha  subunit 

(Rabggta)  DNA,  first  2 introns  and  all  cds  exons. 


ACCESSION 

KEYWORDS 

SOURCE 

ORGANISM 


REFERENCE 

AUTHORS 

TITLE 

JOURNAL 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 


FEATURES 

source 


house  mouse. 

Mus  musculus 

Eukaryota;  Metazoa;  Chordata;  Vertebrata;  Mammalia;  Eutheria; 
Rodent ia;  Sciurognathi ; Muridae;  Murinae;  Mus. 

1 (bases  1 to  2685) 

Detter,J.C.,  Novack,E.K.,  Seabra,M,  Swank, R.T.  and  Kingsmore, S . F . 
Abnormal  Protein  Prenylation  Leads  to  Hypopigmentation/Platelet 
Storage  Pool  Deficiency  in  the  Gunmetal  Mouse  Mutant 
Unpublished 

2 (bases  1 to  2685) 

Detter, J. C . 

Direct  Submission 

Submitted  (ll-FEB-1999)  Molecular  Genetics  & Microbiology,  and 
Medicine,  University  of  Florida,  1600  SW  Archer  Rd,  Gainesville, FL 
32610,  USA 

Location/Qualifiers 
1.  .2685 


/organism="Mus  musculus" 

/strain="C57BL/6 J" 

/db_xref =" taxon : 10090" 

/chromosome="14 " 

/map="between  Mit63  and  Mitl22" 

BASE  COUNT  544  a 771  c 758  g 612  t 

ORIGIN 

1 gtgcaagggt  ccacgggacg  gacggacggg  cgggcgggca  ctgctctcta  tatgcgcggc 

61  gtgtgcctcg  gccctgacgg  gtggggcttc  ggaaagatcc  gtggggacgg  aggccgtggt 

121  tgtggtgagc  gcttcctcct  gacctcacag  tgccagccgc  cggccccctt  ttaactcacg 

181  gggatcctta  tcctgggcct  gctggcagct  tccgggaaac  acctgggggg  cggggccgtg 

241  tgggcggggt  tctgtgattg  acatctcttt  cggcagattg  aacgggctgc  ctccggaccc 

301  ctcccctatc  gccaacgctg  agccgaaacc  atggtgagaa  ccctgagctc  agcccctttc 

361  ccctgcatgc  ccatgccctc  atcccacgtg  gctatctcgg  ccgggcttct  gcaagactgc 

421  ccttgcgagg  caaccggcgt  tcatattatc  ctatcctcat  atccctggta  gcatgggcgc 

481  ctgaaggtga  agacgtccga  agagcaggca  gaggccaaaa  ggctagagcg  ggaacagaag 

541  ctaaagctct  accagtcagc  cactcaagct  gtcttccaga  agcgggaggc  aggcgagctg 

601  gatgaatccg  tcctagaact  gacaagccag  attctgggag  ccaaccctga  ttttgccacc 

661  ctctggaact  gtcgcagaga  agtgctccag  cagctagaaa  cccagaagtc  ccctgaggag 

721  ttggctgctc  ttgtgaaggc  agaactaggc  ttccttgaga  gctgtctgcg  tgtgaaccct 


67 


781 

841 

901 

961 

1021 

1081 

1141 

1201 

1261 

1321 

1381 

1441 

1501 

1561 

1621 

1681 

1741 

1801 

1861 

1921 

1981 

2041 

2101 

2161 

2221 

2281 

2341 

2401 

2461 

2521 

2581 

2641 


68 


aagtcctatg 

tgggcccggg 

tgctgggact 

gccttcactg 

tcctgcctct 

gaaaatgtac 

gatcagagtg 

ctgtgctgcc 

ctgatagtgg 

gtggaatgga 

cctgctgcct 

agcgataccc 

tcagccactg 

cagtccgagc 

ctgctgacca 

ctggagtact 

ctgcgcagca 

gtgctgcacc 

gtcactcatc 

ctgcgctgtc 

gctaaccttc 

gcacttcaga 

ctgtgccaag 

atcctcacct 

actggagagg 

ccattaccac 

cactgtcctg 

ctgggtctga 

aggaattagg 

gagatggctc 

agcaaccaca 

tgaagacagc 


gcacttggca 

agctggagct 

atcggcgatt 

acagcctcat 

tgccccagct 

tgctgagaga 

cctggttcta 

tgcatgtgag 

gctccaagat 

ggactccaga 

ctctcaatga 

agaaggagtg 

atgaacagtt 

ttgagtcctg 

tcatcctcct 

tcagtaccct 

agttcttggt 

tggctcacaa 

ttgacctgtc 

tcgaggtgtt 

cccggctgcg 

ctcttgcctc 

aagagggcat 

aggaggcccc 

cccctctagg 

cactgtgcat 

tgattctgtc 

gttcaccatc 

atcaggttgt 

agcggttaag 

tggtggctca 

tacagtgtac 


ccaccgctgc 

gtgcgctcgc 

tgtagctgca 

cacccggaac 

gcacccccag 

gctggagttg 

tcaccgctgg 

ccgggaagag 

ggggaccttg 

tggcaggaac 

ccacttgccc 

tgtgctttta 

gttcaggtgt 

taaggagctg 

gatgagggca 

caaagctgtg 

ggaaaacagt 

ggatctcaca 

ccataatcgt 

gcaggccagt 

ggagctgtta 

ctgccccagg 

ccgggagcgt 

atctcaccct 

ctgccaagct 

tttggaaaaa 

taccaagacc 

agttttgggc 

ggtttggact 

agcgctgact 

taaccatctg 

ttacatataa 


tggctgctga 

ttcctcgagg 

caggctgctg 

ttctccaact 

ccagactctg 

gtgcagaatg 

ctcctggggc 

gcctgtctgt 

ctactcacgg 

cggcccagcc 

cagcacacgt 

aaaggtcacc 

gagctgtccg 

caggaactgg 

ctggaccccc 

gacccgatga 

gttctcaaga 

gtgctctgcc 

ctccgagcct 

gataatgtcc 

ctatgtaata 

ctagtcttcc 

ttggctgaga 

caccctttaa 

accattactg 

ggaagggagg 

ctcagtgttt 

tatcgtggtg 

gctcatcaga 

gttcttctga 

taataagatc 

taataaataa 


gtcgcctgcc 

ccgatgagcg 

tcgcgcctgc 

attcttcctg 

gcccccaggg 

ccttctttac 

gggcggagcc 

cagtctgctt 

ttgacgaggc 

atgtctggct 

ttcgtgtcat 

aggagtgctg 

tggagaagtc 

agcctgagaa 

tcctctacga 

gagcagccta 

tggagtatgc 

atttggaaca 

tgcccccagc 

tggagaactt 

accgcctcca 

tcaacctgca 

tgctaccgtc 

cttattggga 

ctgccctgac 

ggctctctgg 

atggctggtc 

cctgagcttg 

agagctgtct 

aagtcctgag 

tgacaccctc 

atctt 


tgagcccaac 

gaactttcat 

agaggaacta 

gcattatcgc 

acggctccct 

tgaccccaat 

ccacgatgtt 

ttctcgtccc 

acctttgagt 

ctgtgacctg 

ctggacagga 

gtgccgagac 

cacagtgcta 

taaatggtgc 

gaaagaaaca 

cttggacgac 

tgacgtgcgt 

actgctcttg 

cctggctgct 

ggacggcgtg 

gcagtctgct 

gggcaactcg 

cgtcagcagc 

ctgaataaag 

aacacttcct 

ggtatggggc 

aggaccaggg 

aagctgggat 

cggggctgga 

ttcaaatccc 

ttctggtgtc 
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GenBank  Submission  AF1 27655 


Submission  2 of  a set  of  9 submission ( s ) . 

Comment:  bp  276  contains  the  pt  mutation  (a) . This  mutation  is  in  the 
first  intron  at  the  3*  acceptor.  My  previously  submitted  sequence  contains  the 
correct  nucleotide  (g) . 


LOCUS 

DEFINITION 


ACCESSION 

KEYWORDS 

SOURCE 

ORGANISM 


REFERENCE 

AUTHORS 

TITLE 

JOURNAL 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 


FEATURES 

source 


XXXXX  2685  bp  DNA  ROD  ll-FEB-1999 

Mus  musculus  (with  gunmetal  mutation)  RAB  geranylgeranyl 
transferase,  alpha  subunit  (Rabggta) , first  two  introns  and  all  cds 
exons . 


house  mouse. 

Mus  musculus 

Eukaryota;  Metazoa;  Chordata;  Vertebrata;  Mammalia;  Eutheria; 
Rodentia;  Sciurognathi ; Muridae;  Murinae;  Mus. 

1 (bases  1 to  2685) 

Better, J.C.,  Novack,E.K.,  Seabra,M,  Swank, R.T.  and  Kingsmore, S . F. 
Abnormal  Protein  Prenylation  Leads  to  Hypopigmentation/Platelet 
Storage  Pool  Deficiency  in  the  Gunmetal  Mouse  Mutant 
Unpublished 

2 (bases  1 to  2685) 

Better, J. C . 

Direct  Submission 

Submitted  (ll-FEB-1999)  Molecular  Genetics  & Microbiology,  and 
Medicine,  University  of  Florida,  1600  SW  Archer  RD,  Gainesville, FL 
32610,  USA 

Location/Qualifiers 
1. .2685 


/organism="Mus  musculus" 

/strain="C57BL/6 J-gm/gm" 

/db_xref="taxon: 10090" 

/ chromosome="14 " 

/map="between  Mit63  and  Mitl22" 

BASE  COUNT  545  a 771  c 757  g 612  t 

ORIGIN 

1 gtgcaagggt  ccacgggacg  gacggacggg  cgggcgggca  ctgctctcta  tatgcgcggc 

61  gtgtgcctcg  gccctgacgg  gtggggcttc  ggaaagatcc  gtggggacgg  aggccgtggt 

121  tgtggtgagc  gcttcctcct  gacctcacag  tgccagccgc  cggccccctt  ttaactcacg 

181  gggatcctta  tcctgggcct  gctggcagct  tccgggaaac  acctgggggg  cggggccgtg 

241  tgggcggggt  tctgtgattg  acatctcttt  cggcaaattg  aacgggctgc  ctccggaccc 

301  ctcccctatc  gccaacgctg  agccgaaacc  atggtgagaa  ccctgagctc  agcccctttc 

361  ccctgcatgc  ccatgccctc  atcccacgtg  gctatctcgg  ccgggcttct  gcaagactgc 

421  ccttgcgagg  caaccggcgt  tcatattatc  ctatcctcat  atccctggta  gcatgggcgc 

481  ctgaaggtga  agacgtccga  agagcaggca  gaggccaaaa  ggctagagcg  ggaacagaag 

541  ctaaagctct  accagtcagc  cactcaagct  gtcttccaga  agcgggaggc  aggcgagctg 

601  gatgaatccg  tcctagaact  gacaagccag  attctgggag  ccaaccctga  ttttgccacc 

661  ctctggaact  gtcgcagaga  agtgctccag  cagctagaaa  cccagaagtc  ccctgaggag 

721  ttggctgctc  ttgtgaaggc  agaactaggc  ttccttgaga  gctgtctgcg  tgtgaaccct 

781  aagtcctatg  gcacttggca  ccaccgctgc  tggctgctga  gtcgcctgcc  tgagcccaac 

841  tgggcccggg  agctggagct  gtgcgctcgc  ttcctcgagg  ccgatgagcg  gaactttcat 

901  tgctgggact  atcggcgatt  tgtagctgca  caggctgctg  tcgcgcctgc  agaggaacta 

961  gccttcactg  acagcctcat  cacccggaac  ttctccaact  attcttcctg  gcattatcgc 

1021  tcctgcctct  tgccccagct  gcacccccag  ccagactctg  gcccccaggg  acggctccct 

1081  gaaaatgtac  tgctgagaga  gctggagttg  gtgcagaatg  ccttctttac  tgaccccaat 

1141  gatcagagtg  cctggttcta  tcaccgctgg  ctcctggggc  gggcggagcc  ccacgatgtt 

1201  ctgtgctgcc  tgcatgtgag  ccgggaagag  gcctgtctgt  cagtctgctt  ttctcgtccc 

1261  ctgatagtgg  gctccaagat  ggggaccttg  ctactcacgg  ttgacgaggc  acctttgagt 

1321  gtggaatgga  ggactccaga  tggcaggaac  cggcccagcc  atgtctggct  ctgtgacctg 

1381  cctgctgcct  ctctcaatga  ccacttgccc  cagcacacgt  ttcgtgtcat  ctggacagga 

1441  agcgataccc  agaaggagtg  tgtgctttta  aaaggtcacc  aggagtgctg  gtgccgagac 

1501  tcagccactg  atgaacagtt  gttcaggtgt  gagctgtccg  tggagaagtc  cacagtgcta 


1561 

1621 

1681 

1741 

1801 

1861 

1921 

1981 

2041 

2101 

2161 

2221 

2281 

2341 

2401 

2461 

2521 

2581 

2641 
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cagtccgagc 

ctgctgacca 

ctggagtact 

ctgcgcagca 

gtgctgcacc 

gtcactcatc 

ctgcgctgtc 

gctaaccttc 

gcacttcaga 

ctgtgccaag 

atcctcacct 

actggagagg 

ccattaccac 

cactgtcctg 

ctgggtctga 

aggaattagg 

gagatggctc 

agcaaccaca 

tgaagacagc 


ttgagtcctg 

tcatcctcct 

tcagtaccct 

agttcttggt 

tggctcacaa 

ttgacctgtc 

tcgaggtgtt 

cccggctgcg 

ctcttgcctc 

aagagggcat 

aggaggcccc 

cccctctagg 

cactgtgcat 

tgattctgtc 

gttcaccatc 

atcaggttgt 

agcggttaag 

tggtggctca 

tacagtgtac 


taaggagctg 

gatgagggca 

caaagctgtg 

ggaaaacagt 

ggatctcaca 

ccataatcgt 

gcaggccagt 

ggagctgtta 

ctgccccagg 

ccgggagcgt 

atctcaccct 

ctgccaagct 

tttggaaaaa 

taccaagacc 

agttttgggc 

ggtttggact 

agcgctgact 

taaccatctg 

ttacatataa 


caggaactgg 

ctggaccccc 

gacccgatga 

gttctcaaga 

gtgctctgcc 

ctccgagcct 

gataatgtcc 

ctatgtaata 

ctagtcttcc 

ttggctgaga 

caccctttaa 

accattactg 

ggaagggagg 

ctcagtgttt 

tatcgtggtg 

gctcatcaga 

gttcttctga 

taataagatc 

taataaataa 


agcctgagaa 

tcctctacga 

gagcagccta 

tggagtatgc 

atttggaaca 

tgcccccagc 

tggagaactt 

accgcctcca 

tcaacctgca 

tgctaccgtc 

cttattggga 

ctgccctgac 

ggctctctgg 

atggctggtc 

cctgagcttg 

agagctgtct 

aagtcctgag 

tgacaccctc 

atctt 


taaatggtgc 

gaaagaaaca 

cttggacgac 

tgacgtgcgt 

actgctcttg 

cctggctgct 

ggacggcgtg 

gcagtctgct 

gggcaactcg 

cgtcagcagc 

ctgaataaag 

aacacttcct 

ggtatggggc 

aggaccaggg 

aagctgggat 

cggggctgga 

ttcaaatccc 

ttctggtgtc 
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GenBank  Submission  AF127656 


Submission  3 of  a set  of  9 submission ( s ) . 

Comment:  Please  keep  all  data  CONFIDENTIAL  until  in  press.  Thanks! 


LOCUS 

DEFINITION 

ACCESSION 

KEYWORDS 

SOURCE 

ORGANISM 


REFERENCE 

AUTHORS 

TITLE 

JOURNAL 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 


FEATURES 

source 


itiRNA 


gene 

CDS 


BASE  COUNT 
ORIGIN 


XXXXX  2395  bp  mRNA  ROD  ll-FEB-1999 

Mus  musculus  RAB  geranylgeranyl  transferase,  alpha  subunit 
(Rabggta) mRNA,  complete  cds . 


house  mouse. 

Mus  musculus 

Eukaryota;  Metazoa;  Chordata;  Vertebrata;  Mammalia;  Eutheria; 
Rodentia;  Sciurognathi ; Muridae;  Murinae;  Mus. 

1 (bases  1 to  2395) 

Detter,J.C.,  Novack,E.K.,  Seabra,M,  Swank, R.T.  and  Kingsmore, S . F. 
Abnormal  Protein  Prenylation  Leads  to  Hypopigmentation/Platelet 
Storage  Pool  Deficiency  in  the  Gunmetal  Mouse  Mutant 
Unpublished 

2 (bases  1 to  2395) 

Detter, J. C . 

Direct  Submission 

Submitted  (ll-FEB-1999)  Molecular  Genetics  & Microbiology,  and 
Medicine,  University  of  Florida,  1600  SW  Archer  RD,  Gainesville, FL 
32610,  USA 

Location /Qualifiers 

1. . 2395 

/organism="Mus  musculus" 

/strain="C57BL/6J" 

/db_xref="taxon : 10090" 

/chromosome="14 " 

/map="between  Mit63  and  Mitl22" 

join(l. .124,125. . 181, 182 .. 292, 293 . .419,420. . 605, 606 .. 809, 

810. . 891.892. .979.980. .1078.1079. .1185.1186. .1239, 

1240. . 1326.1327. .1416.1417. .1532.1533. . 1647 . 1648 .. 1733 , 

1734 . . 2395) 

/gene="Rabggta" 

1. . 2395 

/gene="Rabggta" 

179. . 1882 
/gene="Rabggta" 

/codon_start=l 

/product="RABGGTA" 

/translation="MHGRLKVKTSEEQAEAKRLEREQKLKLYQSATQAVFQKREAGEL 

DESVLELTSQILGANPDFATLWNCRREVLQQLETQKSPEELAALVKAELGFLESCLRV 

NPKSYGTWHHRCWLLSRLPEPNWARELELCARFLEADERNFHCWDYRRFVAAQAAVAP 

AEELAFTDSLITRNFSNYSSWHYRSCLLPQLHPQPDSGPQGRLPENVLLRELELVQNA 

FFTDPNDQSAWFYHRWLLGRAEPHDVLCCLHVSREEACLSVCFSRPLIVGSKMGTLLL 

TVDEAPLSVEWRTPDGRNRPSHVWLCDLPAASLNDHLPQHTFRVIWTGSDTQKECVLL 

KGHQECWCRDSATDEQLFRCELSVEKSTVLQSELESCKELQELEPENKWCLLTIILLM 

RALDPLLYEKETLEYFSTLKAVDPMRAAYLDDLRSKFLVENSVLKMEYADVRVLHLAH 

KDLTVLCHLEQLLLVTHLDLSHNRLRALPPALAALRCLEVLQASDNVLENLDGVANLP 

RLRELLLCNNRLQQSAALQTLASCPRLVFLNLQGNSLCQEEGIRERLAEMLPSVSSIL 

tp  II 

502  a 672  c 678  g 543  t 


1 gtgcaagggt  ccacgggacg  gacggacggg  cgggcgggca  ctgctctcta  tatgcgcggc 

61  gtgtgcctcg  gccctgacgg  gtggggcttc  ggaaagatcc  gtggggacgg  aggccgtggt 

121  tgtgattgaa  cgggctgcct  ccggacccct  cccctatcgc  caacgctgag  ccgaaaccat 

181  gcatgggcgc  ctgaaggtga  agacgtccga  agagcaggca  gaggccaaaa  ggctagagcg 

241  ggaacagaag  ctaaagctct  accagtcagc  cactcaagct  gtcttccaga  agcgggaggc 

301  aggcgagctg  gatgaatccg  tcctagaact  gacaagccag  attctgggag  ccaaccctga 

361  ttttgccacc  ctctggaact  gtcgcagaga  agtgctccag  cagctagaaa  cccagaagtc 


421 

481 

541 

601 

661 

721 

781 

841 

901 

961 

1021 

1081 

1141 

1201 

1261 

1321 

1381 

1441 

1501 

1561 

1621 

1681 

1741 

1801 

1861 

1921 

1981 

2041 

2101 

2161 

2221 

2281 

2341 
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ccctgaggag 

tgtgaaccct 

tgagcccaac 

gaactttcat 

agaggaacta 

gcattatcgc 

acggctccct 

tgaccccaat 

ccacgatgtt 

ttctcgtccc 

acctttgagt 

ctgtgacctg 

ctggacagga 

gtgccgagac 

cacagtgcta 

taaatggtgc 

gaaagaaaca 

cttggacgac 

tgacgtgcgt 

actgctcttg 

cctggctgct 

ggacggcgtg 

gcagtctgct 

gggcaactcg 

cgtcagcagc 

ctgaataaag 

aacacttcct 

ggtatggggc 

aggaccaggg 

aagctgggat 

cggggctgga 

ttcaaatccc 

ttctggtgtc 


ttggctgctc 

aagtcctatg 

tgggcccggg 

tgctgggact 

gccttcactg 

tcctgcctct 

gaaaatgtac 

gatcagagtg 

ctgtgctgcc 

ctgatagtgg 

gtggaatgga 

cctgctgcct 

agcgataccc 

tcagccactg 

cagtccgagc 

ctgctgacca 

ctggagtact 

ctgcgcagca 

gtgctgcacc 

gtcactcatc 

ctgcgctgtc 

gctaaccttc 

gcacttcaga 

ctgtgccaag 

atcctcacct 

actggagagg 

ccattaccac 

cactgtcctg 

ctgggtctga 

aggaattagg 

gagatggctc 

agcaaccaca 

tgaagacagc 


ttgtgaaggc 

gcacttggca 

agctggagct 

atcggcgatt 

acagcctcat 

tgccccagct 

tgctgagaga 

cctggttcta 

tgcatgtgag 

gctccaagat 

ggactccaga 

ctctcaatga 

agaaggagtg 

atgaacagtt 

ttgagtcctg 

tcatcctcct 

tcagtaccct 

agttcttggt 

tggctcacaa 

ttgacctgtc 

tcgaggtgtt 

cccggctgcg 

ctcttgcctc 

aagagggcat 

aggaggcccc 

cccctctagg 

cactgtgcat 

tgattctgtc 

gttcaccatc 

atcaggttgt 

agcggttaag 

tggtggctca 

tacagtgtac 


agaactaggc 

ccaccgctgc 

gtgcgctcgc 

tgtagctgca 

cacccggaac 

gcacccccag 

gctggagttg 

tcaccgctgg 

ccgggaagag 

ggggaccttg 

tggcaggaac 

ccacttgccc 

tgtgctttta 

gttcaggtgt 

taaggagctg 

gatgagggca 

caaagctgtg 

ggaaaacagt 

ggatctcaca 

ccataatcgt 

gcaggccagt 

ggagctgtta 

ctgccccagg 

ccgggagcgt 

atctcaccct 

ctgccaagct 

tttggaaaaa 

taccaagacc 

agttttgggc 

ggtttggact 

agcgctgact 

taaccatctg 

ttacatataa 


ttccttgaga 

tggctgctga 

ttcctcgagg 

caggctgctg 

ttctccaact 

ccagactctg 

gtgcagaatg 

ctcctggggc 

gcctgtctgt 

ctactcacgg 

cggcccagcc 

cagcacacgt 

aaaggtcacc 

gagctgtccg 

caggaactgg 

ctggaccccc 

gacccgatga 

gttctcaaga 

gtgctctgcc 

ctccgagcct 

gataatgtcc 

ctatgtaata 

ctagtcttcc 

ttggctgaga 

caccctttaa 

accattactg 

ggaagggagg 

ctcagtgttt 

tatcgtggtg 

gctcatcaga 

gttcttctga 

taataagatc 

taataaataa 


gctgtctgcg 

gtcgcctgcc 

ccgatgagcg 

tcgcgcctgc 

attcttcctg 

gcccccaggg 

ccttctttac 

gggcggagcc 

cagtctgctt 

ttgacgaggc 

atgtctggct 

ttcgtgtcat 

aggagtgctg 

tggagaagtc 

agcctgagaa 

tcctctacga 

gagcagccta 

tggagtatgc 

atttggaaca 

tgcccccagc 

tggagaactt 

accgcctcca 

tcaacctgca 

tgctaccgtc 

cttattggga 

ctgccctgac 

ggctctctgg 

atggctggtc 

cctgagcttg 

agagctgtct 

aagtcctgag 

tgacaccctc 

atctt 
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GenBank  Submission  AF127657 


Submission  4 of  a set  of  9 submission ( s ) . 

Comment:  This  cDNA  sequence  is  the  mutant  form.  A pt  mutation  in  intron  alpha 
causes  a loss  of  exon  1,  which  contains  the  start  codon. 


LOCUS  XXXXX  2338  bp  mRNA  ROD  ll-FEB-1999 

DEFINITION  Mus  musculus  (with  mutation)  RAB  geranylgeranyl  transferase,  alpha 

subunit  (Rabggta) mRNA,  complete  cds  (mutation  in  intron  alpha 
causes  loss  of  exon  1 which  contains  START) . 


ACCESSION 

KEYWORDS 

SOURCE 

ORGANISM 


REFERENCE 

AUTHORS 

TITLE 

JOURNAL 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 


FEATURES 

source 


house  mouse. 

Mus  musculus 

Eukaryota;  Metazoa;  Chordata;  Vertebrata;  Mammalia;  Eutheria; 
Rodent ia;  Sciurognathi ; Muridae;  Murinae;  Mus. 

1 (bases  1 to  2338) 

Detter,J.C.,  Novack,E.K.,  Seabra,M,  Swank, R.T.  and  Kingsmore, S . F. 
Abnormal  Protein  Prenylation  Leads  to  Hypopigmentation/Platelet 
Storage  Pool  Deficiency  in  the  Gunmetal  Mouse  Mutant 
Unpublished 

2 (bases  1 to  2338) 

Detter , J. C . 

Direct  Submission 

Submitted  (ll-FEB-1999)  Molecular  Genetics  & Microbiology,  and 
Medicine,  University  of  Florida,  1600  SW  Archer  RD,  Gainesville, FL 
32610,  USA 

Location/Qualifiers 
1. .2338 


/organism="Mus  musculus" 

/strain="C57BL/6 J-gm/gm" 

/db_xref="taxon : 10090" 

/chromosome="14 " 

/map="between  Mit63  and  Mitl22" 

BASE  COUNT  490  a 649  c 665  g 534  t 

ORIGIN 

1 gtgcaagggt  ccacgggacg  gacggacggg  cgggcgggca  ctgctctcta  tatgcgcggc 

61  gtgtgcctcg  gccctgacgg  gtggggcttc  ggaaagatcc  gtggggacgg  aggccgtggt 

121  tgtgcatggg  cgcctgaagg  tgaagacgtc  cgaagagcag  gcagaggcca  aaaggctaga 

181  gcgggaacag  aagctaaagc  tctaccagtc  agccactcaa  gctgtcttcc  agaagcggga 

241  ggcaggcgag  ctggatgaat  ccgtcctaga  actgacaagc  cagattctgg  gagccaaccc 

301  tgattttgcc  accctctgga  actgtcgcag  agaagtgctc  cagcagctag  aaacccagaa 

361  gtcccctgag  gagttggctg  ctcttgtgaa  ggcagaacta  ggcttccttg  agagctgtct 

421  gcgtgtgaac  cctaagtcct  atggcacttg  gcaccaccgc  tgctggctgc  tgagtcgcct 

481  gcctgagccc  aactgggccc  gggagctgga  gctgtgcgct  cgcttcctcg  aggccgatga 

541  gcggaacttt  cattgctggg  actatcggcg  atttgtagct  gcacaggctg  ctgtcgcgcc 

601  tgcagaggaa  ctagccttca  ctgacagcct  catcacccgg  aacttctcca  actattcttc 

661  ctggcattat  cgctcctgcc  tcttgcccca  gctgcacccc  cagccagact  ctggccccca 

721  gggacggctc  cctgaaaatg  tactgctgag  agagctggag  ttggtgcaga  atgccttctt 

781  tactgacccc  aatgatcaga  gtgcctggtt  ctatcaccgc  tggctcctgg  ggcgggcgga 

841  gccccacgat  gttctgtgct  gcctgcatgt  gagccgggaa  gaggcctgtc  tgtcagtctg 

901  cttttctcgt  cccctgatag  tgggctccaa  gatggggacc  ttgctactca  cggttgacga 

961  ggcacctttg  agtgtggaat  ggaggactcc  agatggcagg  aaccggccca  gccatgtctg 

1021  gctctgtgac  ctgcctgctg  cctctctcaa  tgaccacttg  ccccagcaca  cgtttcgtgt 

1081  catctggaca  ggaagcgata  cccagaagga  gtgtgtgctt  ttaaaaggtc  accaggagtg 

1141  ctggtgccga  gactcagcca  ctgatgaaca  gttgttcagg  tgtgagctgt  ccgtggagaa 

1201  gtccacagtg  ctacagtccg  agcttgagtc  ctgtaaggag  ctgcaggaac  tggagcctga 

1261  gaataaatgg  tgcctgctga  ccatcatcct  cctgatgagg  gcactggacc  ccctcctcta 

1321  cgagaaagaa  acactggagt  acttcagtac  cctcaaagct  gtggacccga  tgagagcagc 

1381  ctacttggac  gacctgcgca  gcaagttctt  ggtggaaaac  agtgttctca  agatggagta 

1441  tgctgacgtg  cgtgtgctgc  acctggctca  caaggatctc  acagtgctct  gccatttgga 

1501  acaactgctc  ttggtcactc  atcttgacct  gtcccataat  cgtctccgag  ccttgccccc 

1561  agccctggct  gctctgcgct  gtctcgaggt  gttgcaggcc  agtgataatg  tcctggagaa 


1621 

1681 

1741 

1801 

1861 

1921 

1981 

2041 

2101 

2161 

2221 

2281 
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cttggacggc 

ccagcagtct 

gcagggcaac 

gtccgtcagc 

ggactgaata 

gacaacactt 

tggggtatgg 

gtcaggacca 

ttgaagctgg 

tctcggggct 

gagttcaaat 

ctcttctggt 


gtggctaacc 

gctgcacttc 

tcgctgtgcc 

agcatcctca 

aagactggag 

cctccattac 

ggccactgtc 

gggctgggtc 

gataggaatt 

ggagagatgg 

cccagcaacc 

gtctgaagac 


ttccccggct 

agactcttgc 

aagaagaggg 

cctaggaggc 

aggcccctct 

caccactgtg 

ctgtgattct 

tgagttcacc 

aggatcaggt 

ctcagcggtt 

acatggtggc 

agctacagtg 


gcgggagctg 

ctcctgcccc 

catccgggag 

cccatctcac 

aggctgccaa 

cattttggaa 

gtctaccaag 

atcagttttg 

tgtggtttgg 

aagagcgctg 

tcataaccat 

tacttacata 


ttactatgta 

aggctagtct 

cgtttggctg 

cctcaccctt 

gctaccatta 

aaaggaaggg 

accctcagtg 

ggctatcgtg 

actgctcatc 

actgttcttc 

ctgtaataag 

taataataaa 


ataaccgcct 

tcctcaacct 

agatgctacc 

taacttattg 

ctgctgccct 

aggggctctc 

tttatggctg 

gtgcctgagc 

agaagagctg 

tgaaagtcct 

atctgacacc 

taaatctt 
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GenBank  Submission  AF127658 


Submission  5 of  a set  of  9 submission ( s ) . 

Comment:  This  cDNA  sequence  contains  Intron  alpha.  bp  276  contains  the  WT 
nucleotide . 


LOCUS 

DEFINITION 


ACCESSION 

KEYWORDS 

SOURCE 

ORGANISM 


REFERENCE 

AUTHORS 

TITLE 

JOURNAL 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 


FEATURES 

source 


XXXXX  2547  bp  itiRNA  ROD  ll-FEB-1999 

Mus  musculus  (with  alternative  splicing)  RAB  geranylgeranyl 
transferase,  alpha  subunit  (Rabggta)  mRNA,  complete  cds 
(alternative  splicing  adds  Intron  alpha) . 


house  mouse. 

Mus  musculus 

Eukaryota;  Metazoa;  Chordata;  Vertebrata;  Mammalia;  Eutheria; 
Rodentia;  Sciurognathi ; Muridae;  Murinae;  Mus. 

1 (bases  1 to  2547) 

Better, J.C.,  Novack,E.K.,  Seabra,M,  Swank, R.T.  and  Kingsmore, S . F. 
Abnormal  Protein  Prenylation  Leads  to  Hypopigmentation/Platelet 
Storage  Pool  Deficiency  in  the  Gunmetal  Mouse  Mutant 
Unpublished 

2 (bases  1 to  2547) 

Better, J. C . 

Direct  Submission 

Submitted  (ll-FEB-1999)  Molecular  Genetics  & Microbiology,  and 
Medicine,  University  of  Florida,  1600  SW  Archer  RD,  Gainesville, FL 
32610,  USA 

Location/Qualifiers 
1. .2547 


/organism="Mus  musculus" 
/strain="C57BL/6 J" 
/db_xref="taxon : 10090" 
/chromosome="14 " 

/map="between  Mit63  and  Mitl22" 
gene  1 . .2547 

/gene= "Rabggta" 

CDS  331.. 2034 


/gene= "Rabggta" 

/codon_start=l 

/product="RABGGTA" 

/translation="MHGRLKVKTSEEQAEAKRLEREQKLKLYQSATQAVFQKREAGEL 

DESVLELTSQILGANPDFATLWNCRREVLQQLETQKSPEELAALVKAELGFLESCLRV 

NPKSYGTWHHRCWLLSRLPEPNWARELELCARFLEADERNFHCWDYRRFVAAQAAVAP 

AEELAFTDSLITRNFSNYSSWHYRSCLLPQLHPQPDSGPQGRLPENVLLRELELVQNA 

FFTDPNDQSAWFYHRWLLGRAEPHDVLCCLHVSREEACLSVCFSRPLIVGSKMGTLLL 

TVDEAPLSVEWRTPDGRNRPSHVWLCDLPAASLNDHLPQHTFRVIWTGSDTQKECVLL 

KGHQECWCRDSATDEQLFRCELSVEKSTVLQSELESCKELQELEPENKWCLLTIILLM 

RALDPLLYEKETLEYFSTLKAVDPMRAAYLDDLRSKFLVENSVLKMEYADVRVLHLAH 

KDLTVLCHLEQLLLVTHLDLSHNRLRALPPALAALRCLEVLQASDNVLENLDGVANLP 

RLRELLLCNNRLQQSAALQTLASCPRLVFLNLQGNSLCQEEGIRERLAEMLPSVSSIL 

fp  II 

BASE  COUNT  521  a 720  c 728  g 578  t 

ORIGIN 

1 gtgcaagggt  ccacgggacg  gacggacggg  cgggcgggca  ctgctctcta  tatgcgcggc 

61  gtgtgcctcg  gccctgacgg  gtggggcttc  ggaaagatcc  gtggggacgg  aggccgtggt 

121  tgtggtgagc  gcttcctcct  gacctcacag  tgccagccgc  cggccccctt  ttaactcacg 

181  gggatcctta  tcctgggcct  gctggcagct  tccgggaaac  acctgggggg  cggggccgtg 

241  tgggcggggt  tctgtgattg  acatctcttt  cggcagattg  aacgggctgc  ctccggaccc 

301  ctcccctatc  gccaacgctg  agccgaaacc  atgcatgggc  gcctgaaggt  gaagacgtcc 

361  gaagagcagg  cagaggccaa  aaggctagag  cgggaacaga  agctaaagct  ctaccagtca 

421  gccactcaag  ctgtcttcca  gaagcgggag  gcaggcgagc  tggatgaatc  cgtcctagaa 

481  ctgacaagcc  agattctggg  agccaaccct  gattttgcca  ccctctggaa  ctgtcgcaga 

541  gaagtgctcc  agcagctaga  aacccagaag  tcccctgagg  agttggctgc  tcttgtgaag 


601 

661 

721 

781 

841 

901 

961 

1021 

1081 

1141 

1201 

1261 

1321 

1381 

1441 

1501 

1561 

1621 

1681 

1741 

1801 

1861 

1921 

1981 

2041 

2101 

2161 

2221 

2281 

2341 

2401 

2461 

2521 
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gcagaactag 

caccaccgct 

ctgtgcgctc 

tttgtagctg 

atcacccgga 

ctgcaccccc 

gagctggagt 

tatcaccgct 

agccgggaag 

atggggacct 

gatggcagga 

gaccacttgc 

tgtgtgcttt 

ttgttcaggt 

tgtaaggagc 

ctgatgaggg 

ctcaaagctg 

gtggaaaaca 

aaggatctca 

tcccataatc 

ttgcaggcca 

cgggagctgt 

tcctgcccca 

atccgggagc 

ccatctcacc 

ggctgccaag 

attttggaaa 

tctaccaaga 

tcagttttgg 

gtggtttgga 

agagcgctga 

cataaccatc 

acttacatat 


gcttccttga 

gctggctgct 

gcttcctcga 

cacaggctgc 

acttctccaa 

agccagactc 

tggtgcagaa 

ggctcctggg 

aggcctgtct 

tgctactcac 

accggcccag 

cccagcacac 

taaaaggtca 

gtgagctgtc 

tgcaggaact 

cactggaccc 

tggacccgat 

gtgttctcaa 

cagtgctctg 

gtctccgagc 

gtgataatgt 

tactatgtaa 

ggctagtctt 

gtttggctga 

ctcacccttt 

ctaccattac 

aaggaaggga 

ccctcagtgt 

gctatcgtgg 

ctgctcatca 

ctgttcttct 

tgtaataaga 

aataataaat 


gagctgtctg 

gagtcgcctg 

ggccgatgag 

tgtcgcgcct 

ctattcttcc 

tggcccccag 

tgccttcttt 

gcgggcggag 

gtcagtctgc 

ggttgacgag 

ccatgtctgg 

gtttcgtgtc 

ccaggagtgc 

cgtggagaag 

ggagcctgag 

cctcctctac 

gagagcagcc 

gatggagtat 

ccatttggaa 

cttgccccca 

cctggagaac 

taaccgcctc 

cctcaacctg 

gatgctaccg 

aacttattgg 

tgctgccctg 

ggggctctct 

ttatggctgg 

tgcctgagct 

gaagagctgt 

gaaagtcctg 

tctgacaccc 

aaatctt 


cgtgtgaacc 

cctgagccca 

cggaactttc 

gcagaggaac 

tggcattatc 

ggacggctcc 

actgacccca 

ccccacgatg 

ttttctcgtc 

gcacctttga 

ctctgtgacc 

atctggacag 

tggtgccgag 

tccacagtgc 

aataaatggt 

gagaaagaaa 

tacttggacg 

gctgacgtgc 

caactgctct 

gccctggctg 

ttggacggcg 

cagcagtctg 

cagggcaact 

tccgtcagca 

gactgaataa 

acaacacttc 

ggggtatggg 

tcaggaccag 

tgaagctggg 

ctcggggctg 

agttcaaatc 

tcttctggtg 


ctaagtccta 

actgggcccg 

attgctggga 

tagccttcac 

gctcctgcct 

ctgaaaatgt 

atgatcagag 

ttctgtgctg 

ccctgatagt 

gtgtggaatg 

tgcctgctgc 

gaagcgatac 

actcagccac 

tacagtccga 

gcctgctgac 

cactggagta 

acctgcgcag 

gtgtgctgca 

tggtcactca 

ctctgcgctg 

tggctaacct 

ctgcacttca 

cgctgtgcca 

gcatcctcac 

agactggaga 

ctccattacc 

gccactgtcc 

ggctgggtct 

ataggaatta 

gagagatggc 

ccagcaacca 

tctgaagaca 


tggcacttgg 

ggagctggag 

ctatcggcga 

tgacagcctc 

cttgccccag 

actgctgaga 

tgcctggttc 

cctgcatgtg 

gggctccaag 

gaggactcca 

ctctctcaat 

ccagaaggag 

tgatgaacag 

gcttgagtcc 

catcatcctc 

cttcagtacc 

caagttcttg 

cctggctcac 

tcttgacctg 

tctcgaggtg 

tccccggctg 

gactcttgcc 

agaagagggc 

ctaggaggcc 

ggcccctcta 

accactgtgc 

tgtgattctg 

gagttcacca 

ggatcaggtt 

tcagcggtta 

catggtggct 

gctacagtgt 
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GenBank  Submission  AF127659 

Submission  6 of  a set  of  9 submission ( s ) . 

Comment:  This  cDNA  sequence  contains  Intron  alpha,  which  contains  the  pt 
mutation,  due  to  alternative  splicing.  bp  276  contains  the  mutant  nucleotide. 


LOCUS 

DEFINITION 

XXXXX  2547  bp  mRNA  ROD  ll-FEB-1999 

Mus  musculus  (with  alternative  splicing  and  pt  mutation)  RAB 
geranylgeranyl  transferase,  alpha  subunit  (Rabggta)  mRNA,  complete 
cds  (alternative  splicing  adds  Intron  alpha  which  contains  the  pt 
mutation) . 

ACCESSION 

KEYWORDS 

SOURCE 

ORGANISM 

• 

• 

house  mouse. 
Mus  musculus 

REFERENCE 

AUTHORS 

TITLE 

Eukaryota;  Metazoa;  Chordata;  Vertebrata;  Mammalia;  Eutheria; 
Rodent ia;  Sciurognathi ; Muridae;  Murinae;  Mus. 

1 (bases  1 to  2547) 

Better, J.C.,  Novack,E.K.,  Seabra,M,  Swank, R.T.  and  Kingsmore, S . F. 
Abnormal  Protein  Prenylation  Leads  to  Hypopigmentation/Platelet 
Storage  Pool  Deficiency  in  the  Gunmetal  Mouse  Mutant 

JOURNAL 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 

Unpublished 
2 (bases  1 to  2547) 

Better, J. C . 

Direct  Submission 

Submitted  (ll-FEB-1999)  Molecular  Genetics  & Microbiology,  and 
Medicine,  University  of  Florida,  1600  SW  Archer  RD,  Gainesville, FL 
32610,  USA 

FEATURES 

Location/Qualifiers 

source  1 . . 2547 

/organism="Mus  musculus" 


gene 

/strain="C57BL/6 J-gm/gm" 

/db  xref="taxon : 10090" 
/chromosome="14 " 

/map="between  Mit63  and  Mitl22" 

1. .2547 

/gene="Rabggta  (with  pt  mutation) " 

CDS 

331. .2034 

/gene="Rabggta  (with  pt  mutation) " 

/codon  start=l 
/product="RABGGTA" 

/translation="MHGRLKVKTSEEQAEAKRLEREQKLKLYQSATQAVFQKREAGEL 

DESVLELTSQILGANPDFATLWNCRREVLQQLETQKSPEELAALVKAELGFLESCLRV 

NPKSYGTWHHRCWLLSRLPEPNWARELELCARFLEADERNFHCWDYRRFVAAQAAVAP 

AEELAFTDSLITRNFSNYSSWHYRSCLLPQLHPQPDSGPQGRLPENVLLRELELVQNA 

FFTDPNDQSAWFYHRWLLGRAEPHDVLCCLHVSREEACLSVCFSRPLIVGSKMGTLLL 

TVDEAPLSVEWRTPDGRNRPSHVWLCDLPAASLNDHLPQHTFRVIWTGSDTQKECVLL 

KGHQECWCRDSATDEQLFRCELSVEKSTVLQSELESCKELQELEPENKWCLLTIILLM 

RALDPLLYEKETLEYFSTLKAVDPMRAAYLDDLRSKFLVENSVLKMEYADVRVLHLAH 

KDLTVLCHLEQLLLVTHLDLSHNRLRALPPALAALRCLEVLQASDNVLENLDGVANLP 

RLRELLLCNNRLQQSAALQTLASCPRLVFLNLQGNSLCQEEGIRERLAEMLPSVSSIL 

ip  II 

BASE  COUNT 
ORIGIN 

522  a 720  c 727  g 578  t 

1 gtgcaagggt  ccacgggacg  gacggacggg  cgggcgggca  ctgctctcta  tatgcgcggc 

61  gtgtgcctcg  gccctgacgg  gtggggcttc  ggaaagatcc  gtggggacgg  aggccgtggt 

121  tgtggtgagc  gcttcctcct  gacctcacag  tgccagccgc  cggccccctt  ttaactcacg 

181  gggatcctta  tcctgggcct  gctggcagct  tccgggaaac  acctgggggg  cggggccgtg 

241  tgggcggggt  tctgtgattg  acatctcttt  cggcaaattg  aacgggctgc  ctccggaccc 

301  ctcccctatc  gccaacgctg  agccgaaacc  atgcatgggc  gcctgaaggt  gaagacgtcc 

361  gaagagcagg  cagaggccaa  aaggctagag  cgggaacaga  agctaaagct  ctaccagtca 

421  gccactcaag  ctgtcttcca  gaagcgggag  gcaggcgagc  tggatgaatc  cgtcctagaa 

481  ctgacaagcc  agattctggg  agccaaccct  gattttgcca  ccctctggaa  ctgtcgcaga 


541 

601 

661 

721 

781 

841 

901 

961 

1021 

1081 

1141 

1201 

1261 

1321 

1381 

1441 

1501 

1561 

1621 

1681 

1741 

1801 

1861 

1921 

1981 

2041 

2101 

2161 

2221 

2281 

2341 

2401 

2461 

2521 
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gaagtgctcc 

gcagaactag 

caccaccgct 

ctgtgcgctc 

tttgtagctg 

atcacccgga 

ctgcaccccc 

gagctggagt 

tatcaccgct 

agccgggaag 

atggggacct 

gatggcagga 

gaccacttgc 

tgtgtgcttt 

ttgttcaggt 

tgtaaggagc 

ctgatgaggg 

ctcaaagctg 

gtggaaaaca 

aaggatctca 

tcccataatc 

ttgcaggcca 

cgggagctgt 

tcctgcccca 

atccgggagc 

ccatctcacc 

ggctgccaag 

attttggaaa 

tctaccaaga 

tcagttttgg 

gtggtttgga 

agagcgctga 

cataaccatc 

acttacatat 


agcagctaga 

gcttccttga 

gctggctgct 

gcttcctcga 

cacaggctgc 

acttctccaa 

agccagactc 

tggtgcagaa 

ggctcctggg 

aggcctgtct 

tgctactcac 

accggcccag 

cccagcacac 

taaaaggtca 

gtgagctgtc 

tgcaggaact 

cactggaccc 

tggacccgat 

gtgttctcaa 

cagtgctctg 

gtctccgagc 

gtgataatgt 

tactatgtaa 

ggctagtctt 

gtttggctga 

ctcacccttt 

ctaccattac 

aaggaaggga 

ccctcagtgt 

gctatcgtgg 

ctgctcatca 

ctgttcttct 

tgtaataaga 

aataataaat 


aacccagaag 

gagctgtctg 

gagtcgcctg 

ggccgatgag 

tgtcgcgcct 

ctattcttcc 

tggcccccag 

tgccttcttt 

gcgggcggag 

gtcagtctgc 

ggttgacgag 

ccatgtctgg 

gtttcgtgtc 

ccaggagtgc 

cgtggagaag 

ggagcctgag 

cctcctctac 

gagagcagcc 

gatggagtat 

ccatttggaa 

cttgccccca 

cctggagaac 

taaccgcctc 

cctcaacctg 

gatgctaccg 

aacttattgg 

tgctgccctg 

ggggctctct 

ttatggctgg 

tgcctgagct 

gaagagctgt 

gaaagtcctg 

tctgacaccc 

aaatctt 


tcccctgagg 

cgtgtgaacc 

cctgagccca 

cggaactttc 

gcagaggaac 

tggcattatc 

ggacggctcc 

actgacccca 

ccccacgatg 

ttttctcgtc 

gcacctttga 

ctctgtgacc 

atctggacag 

tggtgccgag 

tccacagtgc 

aataaatggt 

gagaaagaaa 

tacttggacg 

gctgacgtgc 

caactgctct 

gccctggctg 

ttggacggcg 

cagcagtctg 

cagggcaact 

tccgtcagca 

gactgaataa 

acaacacttc 

ggggtatggg 

tcaggaccag 

tgaagctggg 

ctcggggctg 

agttcaaatc 

tcttctggtg 


agttggctgc 

ctaagtccta 

actgggcccg 

attgctggga 

tagccttcac 

gctcctgcct 

ctgaaaatgt 

atgatcagag 

ttctgtgctg 

ccctgatagt 

gtgtggaatg 

tgcctgctgc 

gaagcgatac 

actcagccac 

tacagtccga 

gcctgctgac 

cactggagta 

acctgcgcag 

gtgtgctgca 

tggtcactca 

ctctgcgctg 

tggctaacct 

ctgcacttca 

cgctgtgcca 

gcatcctcac 

agactggaga 

ctccattacc 

gccactgtcc 

ggctgggtct 

ataggaatta 

gagagatggc 

ccagcaacca 

tctgaagaca 


tcttgtgaag 

tggcacttgg 

ggagctggag 

ctatcggcga 

tgacagcctc 

cttgccccag 

actgctgaga 

tgcctggttc 

cctgcatgtg 

gggctccaag 

gaggactcca 

ctctctcaat 

ccagaaggag 

tgatgaacag 

gcttgagtcc 

catcatcctc 

cttcagtacc 

caagttcttg 

cctggctcac 

tcttgacctg 

tctcgaggtg 

tccccggctg 

gactcttgcc 

agaagagggc 

ctaggaggcc 

ggcccctcta 

accactgtgc 

tgtgattctg 

gagttcacca 

ggatcaggtt 

tcagcggtta 

catggtggct 

gctacagtgt 
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GenBank  Submission  AF127660 


Submission  7 of  a set  of  9 submission (s ) . 

Comment:  This  cDNA  sequence  contains  Intron  9 (bp  1186-1282)  due  to  alternative 
splicing.  This  adds  a new  STOP. 


LOCUS  XXXXX  2492  bp  mRNA  ROD  ll-FEB-1999 

DEFINITION  Mus  musculus  (with  alternative  splicing)  RAB  geranylgeranyl 

transferase,  alpha  subunit  (Rabggta)  mRNA,  complete  cds 
(alternative  splicing  adds  Intron  9 which  contains  new  STOP 
location) . 


ACCESSION 

KEYWORDS 

SOURCE  house  mouse. 

ORGANISM  Mus  musculus 

Eukaryota;  Metazoa;  Chordata;  Vertebrata;  Mammalia;  Eutheria; 
Rodent ia;  Sciurognathi ; Muridae;  Murinae;  Mus. 

REFERENCE  1 (bases  1 to  2492) 

AUTHORS  Detter,J.C.,  Novack,E.K.,  Seabra,M,  Swank, R.T.  and  Kingsmore, S . F. 
TITLE  Abnormal  Protein  Prenylation  Leads  to  Hypopigmentation/Piatelet 

Storage  Pool  Deficiency  in  the  Gunmetal  Mouse  Mutant 
JOURNAL  Unpublished 

REFERENCE  2 (bases  1 to  2492) 

AUTHORS  Detter, J.C. 

TITLE  Direct  Submission 

JOURNAL  Submitted  (ll-FEB-1999)  Molecular  Genetics  & Microbiology,  and 

Medicine,  University  of  Florida,  1600  SW  Archer  RD,  Gainesville, FL 
32610,  USA 

FEATURES  Location/Qualifiers 

source  1 . .2492 


gene 

CDS 


BASE  COUNT 
ORIGIN 

1 

61 

121 

181 

241 

301 

361 

421 

481 

541 

601 

661 

721 


/organism="Mus  musculus" 

/strain="C57BL/6 J" 

/db_xref="taxon : 10090" 

/chromosome="14 " 

/map="between  Mit63  and  Mitl22" 

1.  .2492 

/gene=" Rabggta" 

179. . 1198 
/gene= "Rabggta" 

/codon_start=l 

/product="RABGGTA  (alternative) " 

/ 1 r ans 1 a t i on= "MHGRLKVKT  SEEQAEAKRLEREQKLKLYQSATQAVFQKREAGEL 
DESVLELTSQILGANPDFATLWNCRREVLQQLETQKSPEELAALVKAELGFLESCLRV 
NPKSYGTWHHRCWLLSRLPEPNWARELELCARFLEADERNFHCWDYRRFVAAQAAVAP 
AEELAFTDSLITRNFSNYSSWHYRSCLLPQLHPQPDSGPQGRLPENVLLRELELVQNA 
FFTDPNDQSAWFYHRWLLGRAEPHDVLCCLHVSREEACLSVCFSRPLIVGSKMGTLLL 
TVDEAPLSVEWRTPDGRNRPSHVWLCDLPAASLNDHLPQHTFRVIWTGSDTQKECVLL 
KGDAV" 

517  a 716  c 690  g 569  t 

gtgcaagggt  ccacgggacg  gacggacggg  cgggcgggca  ctgctctcta  tatgcgcggc 
gtgtgcctcg  gccctgacgg  gtggggcttc  ggaaagatcc  gtggggacgg  aggccgtggt 
tgtgattgaa  cgggctgcct  ccggacccct  cccctatcgc  caacgctgag  ccgaaaccat 
gcatgggcgc  ctgaaggtga  agacgtccga  agagcaggca  gaggccaaaa  ggctagagcg 
ggaacagaag  ctaaagctct  accagtcagc  cactcaagct  gtcttccaga  agcgggaggc 
aggcgagctg  gatgaatccg  tcctagaact  gacaagccag  attctgggag  ccaaccctga 
ttttgccacc  ctctggaact  gtcgcagaga  agtgctccag  cagctagaaa  cccagaagtc 
ccctgaggag  ttggctgctc  ttgtgaaggc  agaactaggc  ttccttgaga  gctgtctgcg 
tgtgaaccct  aagtcctatg  gcacttggca  ccaccgctgc  tggctgctga  gtcgcctgcc 
tgagcccaac  tgggcccggg  agctggagct  gtgcgctcgc  ttcctcgagg  ccgatgagcg 
gaactttcat  tgctgggact  atcggcgatt  tgtagctgca  caggctgctg  tcgcgcctgc 
agaggaacta  gccttcactg  acagcctcat  cacccggaac  ttctccaact  attcttcctg 
gcattatcgc  tcctgcctct  tgccccagct  gcacccccag  ccagactctg  gcccccaggg 


781 

841 

901 

961 

1021 

1081 

1141 

1201 

1261 

1321 

1381 

1441 

1501 

1561 

1621 

1681 

1741 

1801 

1861 

1921 

1981 

2041 

2101 

2161 

2221 

2281 

2341 

2401 

2461 


80 


acggctccct 

tgaccccaat 

ccacgatgtt 

ttctcgtccc 

acctttgagt 

ctgtgacctg 

ctggacagga 

ctccccttgc 

actacttaac 

aacagttgtt 

agtcctgtaa 

tcctcctgat 

gtaccctcaa 

tcttggtgga 

ctcacaagga 

acctgtccca 

aggtgttgca 

ggctgcggga 

ttgcctcctg 

agggcatccg 

aggccccatc 

ctctaggctg 

tgtgcatttt 

ttctgtctac 

caccatcagt 

aggttgtggt 

ggttaagagc 

tggctcataa 

agtgtactta 


gaaaatgtac 

gatcagagtg 

ctgtgctgcc 

ctgatagtgg 

gtggaatgga 

cctgctgcct 

agcgataccc 

ctcgccagcc 

ctttccccta 

caggtgtgag 

ggagctgcag 

gagggcactg 

agctgtggac 

aaacagtgtt 

tctcacagtg 

taatcgtctc 

ggccagtgat 

gctgttacta 

ccccaggcta 

ggagcgtttg 

tcaccctcac 

ccaagctacc 

ggaaaaagga 

caagaccctc 

tttgggctat 

ttggactgct 

gctgactgtt 

ccatctgtaa 

catataataa 


tgctgagaga 

cctggttcta 

tgcatgtgag 

gctccaagat 

ggactccaga 

ctctcaatga 

agaaggagtg 

ccccagcctt 

ggtcaccagg 

ctgtccgtgg 

gaactggagc 

gaccccctcc 

ccgatgagag 

ctcaagatgg 

ctctgccatt 

cgagccttgc 

aatgtcctgg 

tgtaataacc 

gtcttcctca 

gctgagatgc 

cctttaactt 

attactgctg 

agggaggggc 

agtgtttatg 

cgtggtgcct 

catcagaaga 

cttctgaaag 

taagatctga 

taaataaatc 


gctggagttg 

tcaccgctgg 

ccgggaagag 

ggggaccttg 

tggcaggaac 

ccacttgccc 

tgtgctttta 

ctctctcctc 

agtgctggtg 

agaagtccac 

ctgagaataa 

tctacgagaa 

cagcctactt 

agtatgctga 

tggaacaact 

ccccagccct 

agaacttgga 

gcctccagca 

acctgcaggg 

taccgtccgt 

attgggactg 

ccctgacaac 

tctctggggt 

gctggtcagg 

gagcttgaag 

gctgtctcgg 

tcctgagttc 

caccctcttc 

tt 


gtgcagaatg 

ctcctggggc 

gcctgtctgt 

ctactcacgg 

cggcccagcc 

cagcacacgt 

aaaggtgatg 

tccccacagt 

ccgagactca 

agtgctacag 

atggtgcctg 

agaaacactg 

ggacgacctg 

cgtgcgtgtg 

gctcttggtc 

ggctgctctg 

cggcgtggct 

gtctgctgca 

caactcgctg 

cagcagcatc 

aataaagact 

acttcctcca 

atggggccac 

accagggctg 

ctgggatagg 

ggctggagag 

aaatcccagc 

tggtgtctga 


ccttctttac 

gggcggagcc 

cagtctgctt 

ttgacgaggc 

atgtctggct 

ttcgtgtcat 

ctgtctaaag 

gtcccatcca 

gccactgatg 

tccgagcttg 

ctgaccatca 

gagtacttca 

cgcagcaagt 

ctgcacctgg 

actcatcttg 

cgctgtctcg 

aaccttcccc 

cttcagactc 

tgccaagaag 

ctcacctagg 

ggagaggccc 

ttaccaccac 

tgtcctgtga 

ggtctgagtt 

aattaggatc 

atggctcagc 

aaccacatgg 

agacagctac 
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GenBank  Submission  AF127661 


Submission  8 of  a set  of  9 submission (s ) . 


Comment:  This  cDNA  sequence  has  lost  exon  1,  which  contains  START,  due  to 
mutation  in  intron  alpha  and  also  contains  Intron  9 (bp  1129-1225)  due  to 
alternative  splicing.  This  adds  a new  STOP. 


LOCUS 

DEFINITION 


ACCESSION 

KEYWORDS 

SOURCE 

ORGANISM 


REFERENCE 

AUTHORS 

TITLE 

JOURNAL 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 


FEATURES 

source 


XXXXX  2435  bp  mRNA  ROD  ll-FEB-1999 

Mus  musculus  (with  mutation  & alternative  splicing)  RAB 
geranylgeranyl  transferase,  alpha  subunit  (Rabggta)  itiRNA,  complete 
cds  (Mutation  causes  loss  of  exon  1 and  START)  (alternative 
splicing  adds  Intron  9 which  contains  new  STOP  location) . 


house  mouse. 

Mus  musculus 

Eukaryota;  Metazoa;  Chordata;  Vertebrata;  Mammalia;  Eutheria; 
Rodentia;  Sciurognathi ; Muridae;  Murinae;  Mus. 

1 (bases  1 to  2435) 

Detter,J.C.,  Novack,E.K.,  Seabra,M,  Swank, R.T.  and  Kingsmore, S . F. 
Abnormal  Protein  Prenylation  Leads  to  Hypopigmentation/Platelet 
Storage  Pool  Deficiency  in  the  Gunmetal  Mouse  Mutant 
Unpublished 

2 (bases  1 to  2435) 

Detter , J. C . 

Direct  Submission 

Submitted  (ll-FEB-1999)  Molecular  Genetics  & Microbiology,  and 
Medicine,  University  of  Florida,  1600  SW  Archer  RD,  Gainesville, FL 
32610,  USA 

Location/Qualifiers 
1. .2435 


/organism="Mus  musculus" 

/strain="C57BL/6 J-gm/gm" 

/db_xref="taxon : 10090" 

/chromosome="14 " 

/map="between  Mit63  and  Mitl22" 

BASE  COUNT  505  a 693  c 677  g 560  t 

ORIGIN 

1 gtgcaagggt  ccacgggacg  gacggacggg  cgggcgggca  ctgctctcta  tatgcgcggc 

61  gtgtgcctcg  gccctgacgg  gtggggcttc  ggaaagatcc  gtggggacgg  aggccgtggt 

121  tgtgcatggg  cgcctgaagg  tgaagacgtc  cgaagagcag  gcagaggcca  aaaggctaga 

181  gcgggaacag  aagctaaagc  tctaccagtc  agccactcaa  gctgtcttcc  agaagcggga 

241  ggcaggcgag  ctggatgaat  ccgtcctaga  actgacaagc  cagattctgg  gagccaaccc 

301  tgattttgcc  accctctgga  actgtcgcag  agaagtgctc  cagcagctag  aaacccagaa 

361  gtcccctgag  gagttggctg  ctcttgtgaa  ggcagaacta  ggcttccttg  agagctgtct 

421  gcgtgtgaac  cctaagtcct  atggcacttg  gcaccaccgc  tgctggctgc  tgagtcgcct 

481  gcctgagccc  aactgggccc  gggagctgga  gctgtgcgct  cgcttcctcg  aggccgatga 

541  gcggaacttt  cattgctggg  actatcggcg  atttgtagct  gcacaggctg  ctgtcgcgcc 

601  tgcagaggaa  ctagccttca  ctgacagcct  catcacccgg  aacttctcca  actattcttc 

661  ctggcattat  cgctcctgcc  tcttgcccca  gctgcacccc  cagccagact  ctggccccca 

721  gggacggctc  cctgaaaatg  tactgctgag  agagctggag  ttggtgcaga  atgccttctt 

781  tactgacccc  aatgatcaga  gtgcctggtt  ctatcaccgc  tggctcctgg  ggcgggcgga 

841  gccccacgat  gttctgtgct  gcctgcatgt  gagccgggaa  gaggcctgtc  tgtcagtctg 

901  cttttctcgt  cccctgatag  tgggctccaa  gatggggacc  ttgctactca  cggttgacga 

961  ggcacctttg  agtgtggaat  ggaggactcc  agatggcagg  aaccggccca  gccatgtctg 

1021  gctctgtgac  ctgcctgctg  cctctctcaa  tgaccacttg  ccccagcaca  cgtttcgtgt 

1081  catctggaca  ggaagcgata  cccagaagga  gtgtgtgctt  ttaaaaggtg  atgctgtcta 

1141  aagctcccct  tgcctcgcca  gccccccagc  cttctctctc  ctctccccac  agtgtcccat 

1201  ccaactactt  aacctttccc  ctaggtcacc  aggagtgctg  gtgccgagac  tcagccactg 

1261  atgaacagtt  gttcaggtgt  gagctgtccg  tggagaagtc  cacagtgcta  cagtccgagc 

1321  ttgagtcctg  taaggagctg  caggaactgg  agcctgagaa  taaatggtgc  ctgctgacca 

1381  tcatcctcct  gatgagggca  ctggaccccc  tcctctacga  gaaagaaaca  ctggagtact 

1441  tcagtaccct  caaagctgtg  gacccgatga  gagcagccta  cttggacgac  ctgcgcagca 


1501 

1561 

1621 

1681 

1741 

1801 

1861 

1921 

1981 

2041 

2101 

2161 

2221 

2281 

2341 

2401 


82 


agttcttggt 

tggctcacaa 

ttgacctgtc 

tcgaggtgtt 

cccggctgcg 

ctcttgcctc 

aagagggcat 

aggaggcccc 

cccctctagg 

cactgtgcat 

tgattctgtc 

gttcaccatc 

atcaggttgt 

agcggttaag 

tggtggctca 

tacagtgtac 


ggaaaacagt 

ggatctcaca 

ccataatcgt 

gcaggccagt 

ggagctgtta 

ctgccccagg 

ccgggagcgt 

atctcaccct 

ctgccaagct 

tttggaaaaa 

taccaagacc 

agttttgggc 

ggtttggact 

agcgctgact 

taaccatctg 

ttacatataa 


gttctcaaga 

gtgctctgcc 

ctccgagcct 

gataatgtcc 

ctatgtaata 

ctagtcttcc 

ttggctgaga 

caccctttaa 

accattactg 

ggaagggagg 

ctcagtgttt 

tatcgtggtg 

gctcatcaga 

gttcttctga 

taataagatc 

taataaataa 


tggagtatgc 

atttggaaca 

tgcccccagc 

tggagaactt 

accgcctcca 

tcaacctgca 

tgctaccgtc 

cttattggga 

ctgccctgac 

ggctctctgg 

atggctggtc 

cctgagcttg 

agagctgtct 

aagtcctgag 

tgacaccctc 

atctt 


tgacgtgcgt 

actgctcttg 

cctggctgct 

ggacggcgtg 

gcagtctgct 

gggcaactcg 

cgtcagcagc 

ctgaataaag 

aacacttcct 

ggtatggggc 

aggaccaggg 

aagctgggat 

cggggctgga 

ttcaaatccc 

ttctggtgtc 


gtgctgcacc 

gtcactcatc 

ctgcgctgtc 

gctaaccttc 

gcacttcaga 

ctgtgccaag 

atcctcacct 

actggagagg 

ccattaccac 

cactgtcctg 

ctgggtctga 

aggaattagg 

gagatggctc 

agcaaccaca 

tgaagacagc 
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GenBank  Submission  AF127662 

Submission  9 of  a set  of  9 submission (s ) . 

Comment:  This  cDNA  sequence  has  added  the  first  84  bp  of  Intron  alpha  and  lost 
the  first  13  bp  of  exon  1 due  to  cryptic  splicing.  It  should  give  the 
normal  protein  b/c  the  point  mutation  in  Intron  alpha  has  been  spliced 
out  and  the  start  remains.  The  new  spliced  exon  1 goes  from  bp  124-252. 


LOCUS 

DEFINITION 

XXXXX  2466  bp  mRNA  ROD  ll-FEB-1999 

Mus  musculus  (with  mutation  & cryptic  splicing)  RAB  geranylgeranyl 
transferase,  alpha  subunit  (Rabggta)  mRNA,  complete  cds  (cryptic 
splicing  adds  the  first  84bp  of  Intron  alpha  and  removes  the  first 
13  bp  of  exon  1,  START  remains) . 

ACCESSION 

KEYWORDS 

SOURCE 

ORGANISM 

• 

/ 

• 

house  mouse. 
Mus  musculus 

REFERENCE 

AUTHORS 

TITLE 

Eukaryota;  Metazoa;  Chordata;  Vertebrata;  Mammalia;  Eutheria; 
Rodentia;  Sciurognathi ; Muridae;  Murinae;  Mus. 

1 (bases  1 to  2466) 

Better, J.C.,  Novack,E.K.,  Seabra,M,  Swank, R.T.  and  Kingsmore, S . F. 
Abnormal  Protein  Prenylation  Leads  to  Hypopigmentation/Platelet 
Storage  Pool  Deficiency  in  the  Gunmetal  Mouse  Mutant 

JOURNAL 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 

Unpublished 
2 (bases  1 to  2466) 

Better, J. C . 

Direct  Submission 

Submitted  (ll-FEB-1999)  Molecular  Genetics  & Microbiology,  and 
Medicine,  University  of  Florida,  1600  SW  Archer  RD,  Gainesville, FL 
32610,  USA 

FEATURES 

Location/Qualifiers 

source  1 . .2466 

/organism="Mus  musculus" 


gene 

/strain=”C57BL/6 J-gm/gm" 

/db  xref="taxon : 10090" 

/chromosome="14 " 

/map="between  Mit63  and  Mitl22" 

1. .2466 

/gene="Rabggta  (with  gunmetal  mutation) " 

CDS 

250. .1953 

/gene="Rabggta  (with  gunmetal  mutation) " 

/codon  start=l 
/product="RABGGTA" 

/translation="MHGRLKVKTSEEQAEAKRLEREQKLKLYQSATQAVFQKREAGEL 

DESVLELTSQILGANPDFATLWNCRREVLQQLETQKSPEELAALVKAELGFLESCLRV 

NPKSYGTWHHRCWLLSRLPEPNWARELELCARFLEADERNFHCWDYRRFVAAQAAVAP 

AEELAFTDSLITRNFSNYSSWHYRSCLLPQLHPQPDSGPQGRLPENVLLRELELVQNA 

FFTDPNDQSAWFYHRWLLGRAEPHDVLCCLHVSREEACLSVCFSRPLIVGSKMGTLLL 

TVDEAPLSVEWRTPDGRNRPSHVWLCDLPAASLNDHLPQHTFRVIWTGSDTQKECVLL 

KGHQECWCRDSATDEQLFRCELSVEKSTVLQSELESCKELQELEPENKWCLLTIILLM 

RALDPLLYEKETLEYFSTLKAVDPMRAAYLDDLRSKFLVENSVLKMEYADVRVLHLAH 

KDLTVLCHLEQLLLVTHLDLSHNRLRALPPALAALRCLEVLQASDNVLENLDGVANLP 

RLRELLLCNNRLQQSAALQTLASCPRLVFLNLQGNSLCQEEGIRERLAEMLPSVSSIL 

tp  fi 

BASE  COUNT 
ORIGIN 

510  a 702  c 695  g 559  t 

1 gtgcaagggt  ccacgggacg  gacggacggg  cgggcgggca  ctgctctcta  tatgcgcggc 

61  gtgtgcctcg  gccctgacgg  gtggggcttc  ggaaagatcc  gtggggacgg  aggccgtggt 

121  tgtggtgagc  gcttcctcct  gacctcacag  tgccagccgc  cggccccctt  ttaactcacg 

181  gggatcctta  tcctgggcct  gctggcagcc  tccggacccc  tcccctatcg  ccaacgctga 

241  gccgaaacca  tgcatgggcg  cctgaaggtg  aagacgtccg  aagagcaggc  agaggccaaa 

301  aggctagagc  gggaacagaa  gctaaagctc  taccagtcag  ccactcaagc  tgtcttccag 

361  aagcgggagg  caggcgagct  ggatgaatcc  gtcctagaac  tgacaagcca  gattctggga 


421 

481 

541 

601 

661 

721 

781 

841 

901 

961 

1021 

1081 

1141 

1201 

1261 

1321 

1381 

1441 

1501 

1561 

1621 

1681 

1741 

1801 

1861 

1921 

1981 

2041 

2101 

2161 

2221 

2281 

2341 

2401 

2461 


84 


gccaaccctg 

acccagaagt 

agctgtctgc 

agtcgcctgc 

gccgatgagc 

gtcgcgcctg 

tattcttcct 

ggcccccagg 

gccttcttta 

cgggcggagc 

tcagtctgct 

gttgacgagg 

catgtctggc 

tttcgtgtca 

caggagtgct 

gtggagaagt 

gagcctgaga 

ctcctctacg 

agagcagcct 

atggagtatg 

catttggaac 

ttgcccccag 

ctggagaact 

aaccgcctcc 

ctcaacctgc 

atgctaccgt 

acttattggg 

gctgccctga 

gggctctctg 

tatggctggt 

gcctgagctt 

aagagctgtc 

aaagtcctga 

ctgacaccct 

aatctt 


attttgccac 

cccctgagga 

gtgtgaaccc 

ctgagcccaa 

ggaactttca 

cagaggaact 

ggcattatcg 

gacggctccc 

ctgaccccaa 

cccacgatgt 

tttctcgtcc 

cacctttgag 

tctgtgacct 

tctggacagg 

ggtgccgaga 

ccacagtgct 

ataaatggtg 

agaaagaaac 

acttggacga 

ctgacgtgcg 

aactgctctt 

ccctggctgc 

tggacggcgt 

agcagtctgc 

agggcaactc 

ccgtcagcag 

actgaataaa 

caacacttcc 

gggtatgggg 

caggaccagg 

gaagctggga 

tcggggctgg 

gttcaaatcc 

cttctggtgt 


cctctggaac 

gttggctgct 

taagtcctat 

ctgggcccgg 

ttgctgggac 

agccttcact 

ctcctgcctc 

tgaaaatgta 

tgatcagagt 

tctgtgctgc 

cctgatagtg 

tgtggaatgg 

gcctgctgcc 

aagcgatacc 

ctcagccact 

acagtccgag 

cctgctgacc 

actggagtac 

cctgcgcagc 

tgtgctgcac 

ggtcactcat 

tctgcgctgt 

ggctaacctt 

tgcacttcag 

gctgtgccaa 

catcctcacc 

gactggagag 

tccattacca 

ccactgtcct 

gctgggtctg 

taggaattag 

agagatggct 

cagcaaccac 

ctgaagacag 


tgtcgcagag 

cttgtgaagg 

ggcacttggc 

gagctggagc 

tatcggcgat 

gacagcctca 

ttgccccagc 

ctgctgagag 

gcctggttct 

ctgcatgtga 

ggctccaaga 

aggactccag 

tctctcaatg 

cagaaggagt 

gatgaacagt 

cttgagtcct 

atcatcctcc 

ttcagtaccc 

aagttcttgg 

ctggctcaca 

cttgacctgt 

ctcgaggtgt 

ccccggctgc 

actcttgcct 

gaagagggca 

taggaggccc 

gcccctctag 

ccactgtgca 

gtgattctgt 

agttcaccat 

gatcaggttg 

cagcggttaa 

atggtggctc 

ctacagtgta 


aagtgctcca 

cagaactagg 

accaccgctg 

tgtgcgctcg 

ttgtagctgc 

tcacccggaa 

tgcaccccca 

agctggagtt 

atcaccgctg 

gccgggaaga 

tggggacctt 

atggcaggaa 

accacttgcc 

gtgtgctttt 

tgttcaggtg 

gtaaggagct 

tgatgagggc 

tcaaagctgt 

tggaaaacag 

aggatctcac 

cccataatcg 

tgcaggccag 

gggagctgtt 

cctgccccag 

tccgggagcg 

catctcaccc 

gctgccaagc 

ttttggaaaa 

ctaccaagac 

cagttttggg 

tggtttggac 

gagcgctgac 

ataaccatct 

cttacatata 


gcagctagaa 

cttccttgag 

ctggctgctg 

cttcctcgag 

acaggctgct 

cttctccaac 

gccagactct 

ggtgcagaat 

gctcctgggg 

ggcctgtctg 

gctactcacg 

ccggcccagc 

ccagcacacg 

aaaaggtcac 

tgagctgtcc 

gcaggaactg 

actggacccc 

ggacccgatg 

tgttctcaag 

agtgctctgc 

tctccgagcc 

tgataatgtc 

actatgtaat 

gctagtcttc 

tttggctgag 

tcacccttta 

taccattact 

aggaagggag 

cctcagtgtt 

ctatcgtggt 

tgctcatcag 

tgttcttctg 

gtaataagat 

ataataaata 
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Comparisons  of  Rabggta  DNA  and  mRNA  from  gm/gm  and  B6  (control)  mice. 
This  Appendix  complements  the  RT-PCR  fragments  shown  in  Figure  4.4.  The 
two  sets  of  20  black  italic  nucleotides  shown  in  this  sequence  represents  the 
locations  of  the  primers  used  to  amplify  the  fragments  seen  in  Figure  4.4.  All 
sequence  originated  from  the  Rabggta  gene.  Sequence  1 corresponds  to  B6 
DNA  (contains  all  exons  but  only  intron  alpha  for  space  saving  purposes). 
Sequence  2 corresponds  to  gm/gm  DNA  (contains  all  exons  but  only  intron 
alpha  for  space  saving  purposes).  Sequence  3 corresponds  to  B6  mRNA  (see 
Figure  4.4b,  band  A).  Sequence  4 corresponds  to  gm/gm  mRNA  (see  Figure 
4.4b,  band  C).  Sequence  5 corresponds  to  gm/gm  mRNA  that  has  undergone 
cryptic  splicing  (see  Figure  4.4b,  band  D).  The  three  spaces  (...)  before  a 
bolded  letter  identifies  the  exon/intron  boundaries.  The  underlined  “ATG” 
represents  the  start  codon.  The  underlined  “TAG”  represents  the  stop  codon. 
Genomic  sequences  1 & 2 only  differ  (green  “g”  and  red  “a”)  at  the  point 
mutation  shown  in  Figure  4,4c.  The  mutation  occurs  at  the  last  nucleotide  of 
intron  alpha’s  splice  acceptor.  Coding  sequences  3 & 4 differ  at  exon  1 . B6 
DNA  (#  1)  undergoes  normal  splicing,  and  thus  retains  exon  1 (green  sequence 
in  # 3)  along  with  its  “ATG”.  gm/gm  DNA  (#  2)  does  not  undergo  normal  splicing 
due  to  the  pt  mutation,  and  thus  skips/splices  out  exon  1 (red  asterisks  in  # 4) 
causing  the  loss  of  its  “ATG”.  Coding  sequence  5 has  utilized  cryptic  splice  sites 
70  bp  from  the  3’  end  of  intron  alpha  and  13  bp  from  the  5’  end  of  exon  1.  This 
region  is  identified  by  the  blue  sequence.  The  “ATG”  is  retained  in  this  gm/gm 
fragment. 

1-  6rGCA>»GGGrCC>»CGGGACGGACGGACGGGCGGGCGGGCACTGCTCTCTATATGCGCGGCGTGTGCCTC 

2.  GrGCAAGGGrCCACGGGACGGACGGACGGGCGGGCGGGCACTGCTCTCTATATGCGCGGCGTGTGCCTC 

3.  GrGCAAGGGTCCACGGGACGGACGGACGGGCGGGCGGGCACTGCTCTCTATATGCGCGGCGTGTGCCTC 

4.  GrGCAAGGGrcCACGGGACGGACGGACGGGCGGGCGGGCACTGCTCTCTATATGCGCGGCGTGTGCCTC 

5.  GrGCAAGGGTCCACGGGACGGACGGACGGGCGGGCGGGCACTGCTCTCTATATGCGCGGCGTGTGCCTC 

1 . GGCCCTGACGGGTGGGGCTTCGGAAAGATCCGTGGGGACGGAGGCCGTGGTT GT Ggtgagcgcttcctcctgacctcac 

2.  GGCCCTGACGGGTGGGGCTTCGGAAAGATCCGTGGGGACGGAGGCCGT GGTT  GT  Ggtgagcgcttcctcctgacctcac 

3.  GGCCCTGACGGGTGGGGCTTCGGAAAGATCCGTGGGGACGGAGGCCGTGGTTGTG 

4.  GGCCCTGACGGGTGGGGCTTCGGAAAGATCCGTGGGGACGGAGGCCGTGGTTGTG 

5.  GGCCCTGACGGGTGGGGCTTCGGAAAGATCCGTGGGGACGGAGGCCGT  GGTT  GT Ggtgagcgcttcctcctgacctcac 

1.  agtgccagccgccggcccccttttaactcacggggatccttatcctgggcctgctggcagcttccgggaaacacctggggggcggggccgtgtgggcggggttctgtgatt 

2.  agtgccagccgccggcccccttttaactcacggggatccttatcctgggcctgctggcagcttccgggaaacacctggggggcggggccgtgtgggcggggttctgtgatt 

3  

4  

5.  agtgccagccgccggcccccttttaactcacggggatccttatcctgggcctgctggcag 

1 . gacatctctttcggcag ATTGAACGGGCTGCCTCCGGACCCCTCCCCTATCGCCAACGCT GAGCCGAAACCAT G . . . C 

2.  gacatctctttcggcaaATTGAACGGGCTGCCTCCGGACCCCTCCCCTATCGCCAACGCT GAGCCGAAACCATG . . . C 

ATTGAACGGGCTGCCTCCGGACCCCTCCCCTATCGCCAACGCTGAGCCGAAACCAIG...C 

<>***>**»*  a a********************************  ****************  ******************  C 

CCTCCGGACCCCTCCCCTATCGCCAACGCTGAGCCGAAACCMG  -C 


3. 

4. 
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1.  ATGGGCGCCTGAAGGTGAAGACGTCCGAAGAGCAGGCAGAGGCCAAAAGGCTAGAGCGGGAACAGAAG 

2.  ATGGGCGCCTGAAGGTGAAGACGTCCGAAGAGCAGGCAGAGGCCAAAAGGCTAGAGCGGGAACAGAAG 

3.  ATGGGCGCCTGAAGGTGAAGACGTCCGAAGAGCAGGCAGAGGCCAAAAGGCTAGAGCGGGAACAGAAG 

4.  ATGGGCGCCTGAAGGTGAAGACGTCCGAAGAGCAGGCAGAGGCCAAAAGGCTAGAGCGGGAACAGAAG 

5.  ATGGGCGCCTGAAGGTGAAGACGTCCGAAGAGCAGGCAGAGGCCAAAAGGCTAGAGCGGGAACAGAAG 

1-  CTAAAGCTCTACCAGTCAGCCACTCAAGCTGTCTTCCAGAAG...CGGGAGGCAGGCGAGCTGGATGAATC 

2-  CTAAAGCTCTACCAGTCAGCCACTCAAGCTGTCTTCCAGAAG . . . CGGGAGGCAGGCGAGCTGGATGAATC 

3.  CTAAAGCTCTACCAGTCAGCCACTCAAGCTGTCTTCCAGAAG . . . CGGGAGGCAGGCGAGCTGGATGAATC 

4.  CTAAAGCTCTACCAGTCAGCCACTCAAGCTGTCTTCCAGAAG . . . CGGGAGGCAGGCGAGCTGGATGAATC 

5.  CTAAAGCTCTACCAGTCAGCCACTCAAGCTGTCTTCCAGAAG . . . CGGGAGGCAGGCGAGCTGGATGAATC 

1.  CGTCCTAGAACTGACAAGCCAGATTCTGGGAGCCAACCCTGATTTTGCCACCCTCTGGAACTGTCGCAGA 

2.  CGTCCTAGAACTGACAAGCCAGATTCTGGGAGCCAACCCTGATTTTGCCACCCTCTGGAACTGTCGCAGA 

3.  CGTCCTAGAACTGACAAGCCAGATTCTGGGAGCCAACCCTGATTTTGCCACCCTCTGGAACTGTCGCAGA 

4.  CGTCCTAGAACTGACAAGCCAGATTCTGGGAGCCAACCCTGATTTTGCCACCCTCTGGAACTGTCGCAGA 

5.  CGTCCTAGAACTGACAAGCCAGATTCTGGGAGCCAACCCTGATTTTGCCACCCTCTGGAACTGTCGCAGA 

1 • GAAGTGCTCCAGCAGCTAGAAACCCAGAAGT. . . CCCCTGAGGAGTTGGCTGCTCTTGTGAAGGCAGAACT 
2-  GAAGTGCTCCAGCAGCTAGAAACCCAGAAGT. . . CCCCTGAGGAGTTGGCTGCTCTTGTGAAGGCAGAACT 

3.  GAAGTGCTCCAGCAGCTAGAAACCCAGAAGT. . . CCCCTGAGGAGTTGGCTGCTCTTGTGAAGGCAGAACT 

4.  GAAGTGCTCCAGCAGCTAGAAACCCAGAAGT. . . CCCCTGAGGAGTTGGCTGCTCTTGTGAAGGCAGAACT 

5.  GAAGTGCTCCAGCAGCTAGAAACCCAGAAGT. . . CCCCTGAGGAGTTGGCTGCTCTTGTGAAGGCAGAACT 

1-  AGGCTTCCTTGAGAGCTGTCrGCGTGTGAACCCTAAGTCCTATGGCACTTGGCACCACCGCTGCTGGCTGC 

2.  AGGCTTCCTTGAGAGCTGTCrGCGTGTGAACCCTAAGTCCTATGGCACTTGGCACCACCGCTGCTGGCTGC 

3.  AGGCTTCCTTGAGAGCTGTCTGCGTGTGAACCCTAAGTCCTATGGCACTTGGCACCACCGCTGCTGGCTGC 

4.  AGGCTTCCTTGAGAGCrGrCTGCGTGTGAACCCTAAGTCCTATGGCACTTGGCACCACCGCTGCTGGCTGC 

5.  AGGCTTCCTTGAGAGCrGTCTGCGTGrGAACCCTAAGTCCTATGGCACTTGGCACCACCGCTGCTGGCTGC 

1-  TGAGTCGCCTGCCTGAGCCCAACTGGGCCCGGGAGCTGGAGCTGTGCGCTCGCTTCCTCGAGGCCGATGA 

2.  TGAGTCGCCTGCCTGAGCCCAACTGGGCCCGGGAGCTGGAGCTGTGCGCTCGCTTCCTCGAGGCCGATGA 

3.  TGAGTCGCCTGCCTGAGCCCAACTGGGCCCGGGAGCTGGAGCTGTGCGCTCGCTTCCTCGAGGCCGATGA 

4.  TGAGTCGCCTGCCTGAGCCCAACTGGGCCCGGGAGCTGGAGCTGTGCGCTCGCTTCCTCGAGGCCGATGA 

5.  TGAGTCGCCTGCCTGAGCCCAACTGGGCCCGGGAGCTGGAGCTGTGCGCTCGCTTCCTCGAGGCCGATGA 

1-  GCGGAACT...TTCATTGCTGGGACTATCGGCGATTTGTAGCTGCACAGGCTGCTGTCGCGCCTGCAGAGG 

2-  GCGGAACT. . . TTCATTGCTGGGACTATCGGCGATTTGTAGCTGCACAGGCTGCTGTCGCGCCTGCAGAGG 

3-  GCGGAACT.. .TTCATTGCTGGGACTATCGGCGATTTGTAGCTGCACAGGCTGCTGTCGCGCCTGCAGAGG 
4.  GCGGAACT. . . TTCATTGCTGGGACTATCGGCGATTTGTAGCTGCACAGGCTGCTGTCGCGCCTGCAGAGG 
5-  GCGGAACT. . . TTCATTGCTGGGACTATCGGCGATTTGTAGCTGCACAGGCTGCTGTCGCGCCTGCAGAGG 

1-  AACTAGCCTTCACTGACAGCCTCATCACCCGGAACTTCTCCAACTATTCTTCCTGGCATTATCGCTCCTGC 

2-  AACTAGCCTTCACTGACAGCCTCATCACCCGGAACTTCTCCAACTATTCTTCCTGGCATTATCGCTCCTGC 

3.  AACTAGCCTTCACTGACAGCCTCATCACCCGGAACTTCTCCAACTATTCTTCCTGGCATTATCGCTCCTGC 

4.  AACTAGCCTTCACTGACAGCCTCATCACCCGGAACTTCTCCAACTATTCTTCCTGGCATTATCGCTCCTGC 

5.  AACTAGCCTTCACTGACAGCCTCATCACCCGGAACTTCTCCAACTATTCTTCCTGGCATTATCGCTCCTGC 

1-  CTCTTGCCCCAGCTGCACCCCCAGCCAGACTCTGGCCCCCAGGGACGGCTCCCTGAAAATGTACTGCTGA 

2.  CTCTTGCCCCAGCTGCACCCCCAGCCAGACTCTGGCCCCCAGGGACGGCTCCCTGAAAATGTACTGCTGA 

3.  CTCTTGCCCCAGCTGCACCCCCAGCCAGACTCTGGCCCCCAGGGACGGCTCCCTGAAAATGTACTGCTGA 

4.  CTCTTGCCCCAGCTGCACCCCCAGCCAGACTCTGGCCCCCAGGGACGGCTCCCTGAAAATGTACTGCTGA 

5.  CTCTTGCCCCAGCTGCACCCCCAGCCAGACTCTGGCCCCCAGGGACGGCTCCCTGAAAATGTACTGCTGA 

1 • GAG . . . AGCTGGAGTTGGTGCAGAATGCCTTCTTTACTGACCCCAATGATCAGAGTGCCTGGTTCTATCACC 

2-  GAG . . . AGCTGGAGTTGGTGCAGAATGCCTTCTTTACTGACCCCAATGATCAGAGTGCCTGGTTCTATCACC 

3.  GAG . . . AGCTGGAGTTGGTGCAGAATGCCTTCTTTACTGACCCCAATGATCAGAGTGCCTGGTTCTATCACC 

4.  GAG . . . AGCTGGAGTTGGTGCAGAATGCCTTCTTTACTGACCCCAATGATCAGAGTGCCTGGTTCTATCACC 

5.  GAG...AGCTGGAGTTGGTGCAGAATGCCTTCTTTACTGACCCCAATGATCAGAGTGCCTGGTTCTATCACC 
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1.  GCTGGCTCCTGGGGCG...GGCGGAGCCCCACGATGTTCTGTGCTGCCTGCATGTGAGCCGGGAAGAGGCC 

2.  GCTGGCTCCTGGGGCG . . . GGCGGAGCCCCACGATGTTCTGTGCTGCCTGCATGTGAGCCGGGAAGAGGCC 

3.  GCTGGCTCCTGGGGCG . . . GGCGGAGCCCCACGATGTTCTGTGCTGCCTGCATGTGAGCCGGGAAGAGGCC 

4.  GCTGGCTCCTGGGGCG . . . GGCGGAGCCCCACGATGTTCTGTGCTGCCTGCATGTGAGCCGGGAAGAGGCC 

5.  GCTGGCTCCTGGGGCG . . . GGCGGAGCCCCACGATGTTCTGTGCTGCCTGCATGTGAGCCGGGAAGAGGCC 

1.  TGTCTGTCAGTCTGCTTTTCTCGTCCCCTGATAGTG...GGCTCCAAGATGGGGACCTTGCTACTCACGGTTG 

2.  TGTCTGTCAGTCTGCTTTTCTCGTCCCCTGATAGTG . . . GGCTCCAAGATGGGGACCTTGCTACTCACGGTTG 

3.  TGTCTGTCAGTCTGCTTTTCTCGTCCCCTGATAGTG . . . GGCTCCAAGATGGGGACCTTGCTACTCACGGTTG 

4.  TGTCTGTCAGTCTGCTTTTCTCGTCCCCTGATAGTG . . . GGCTCCAAGATGGGGACCTTGCTACTCACGGTTG 

5.  TGTCTGTCAGTCTGCTTTTCTCGTCCCCTGATAGTG . . . GGCTCCAAGATGGGGACCTTGCTACTCACGGTTG 

1 . ACGAGGCACCTTTGAGTGTGGAATGGAGGACTCCAGATGGCAGGAACCGGCCCAGCCATGTCTGG . . . CTC 

2.  ACGAGGCACCTTTGAGTGTGGAATGGAGGACTCCAGATGGCAGGAACCGGCCCAGCCATGTCTGG . . . CTC 

3.  ACGAGGCACCTTTGAGTGTGGAATGGAGGACTCCAGATGGCAGGAACCGGCCCAGCCATGTCTGG . . . CTC 

4.  ACGAGGCACCTTTGAGTGTGGAATGGAGGACTCCAGATGGCAGGAACCGGCCCAGCCATGTCTGG . . . CTC 

5.  ACGAGGCACCTTTGAGTGTGGAATGGAGGACTCCAGATGGCAGGAACCGGCCCAGCCATGTCTGG . . . CTC 

1-  TGTGACCTGCCTGCTGCCTCTCTCAATGACCACTTGCCCCAGCACACGTTTCGTGTCATCTGGACAGGAAG 

2.  TGTGACCTGCCTGCTGCCTCTCTCAATGACCACTTGCCCCAGCACACGTTTCGTGTCATCTGGACAGGAAG 

3.  TGTGACCTGCCTGCTGCCTCTCTCAATGACCACTTGCCCCAGCACACGTTTCGTGTCATCTGGACAGGAAG 

4.  TGTGACCTGCCTGCTGCCTCTCTCAATGACCACTTGCCCCAGCACACGTTTCGTGTCATCTGGACAGGAAG 

5.  TGTGACCTGCCTGCTGCCTCTCTCAATGACCACTTGCCCCAGCACACGTTTCGTGTCATCTGGACAGGAAG 

1 • CGATACCCAGAAGGAGTGTGTGCTTTTAAAAGG . . . TCACCAGGAGTGCTGGTGCCGAGACTCAGCCACTG 

2.  CGATACCCAGAAGGAGTGTGTGCTTTTAAAAGG . . . TCACCAGGAGTGCTGGTGCCGAGACTCAGCCACTG 

3.  CGATACCCAGAAGGAGTGTGTGCTTTTAAAAGG...TCACCAGGAGTGCTGGTGCCGAGACTCAGCCACTG 

4.  CGATACCCAGAAGGAGTGTGTGCTTTTAAAAGG...TCACCAGGAGTGCTGGTGCCGAGACTCAGCCACTG 

5.  CGATACCCAGAAGGAGTGTGTGCTTTTAAAAGG . . . TCACCAGGAGTGCTGGTGCCGAGACTCAGCCACTG 

1 • ATGAACAGTTGTTCAGGTG . . . TGAGCTGTCCGTGGAGAAGTCCACAGTGCTACAGTCCGAGCTTGAGTCC 

2-  ATGAACAGTTGTTCAGGTG . . . TGAGCTGTCCGTGGAGAAGTCCACAGTGCTACAGTCCGAGCTTGAGTCC 

3.  ATGAACAGTTGTTCAGGTG . . . TGAGCTGTCCGTGGAGAAGTCCACAGTGCTACAGTCCGAGCTTGAGTCC 

4.  ATGAACAGTTGTTCAGGTG . . . TGAGCTGTCCGTGGAGAAGTCCACAGTGCTACAGTCCGAGCTTGAGTCC 

5.  ATGAACAGTTGTTCAGGTG . . . TGAGCTGTCCGTGGAGAAGTCCACAGTGCTACAGTCCGAGCTTGAGTCC 

1 • TGTAAGGAGCTGCAGGAACTGGAGCCTGAGAATAAATG. . . GTGCCTGCTGACCATCATCCTCCTGATGAG 
2-  TGTAAGGAGCTGCAGGAACTGGAGCCTGAGAATAAATG . . . GTGCCTGCTGACCATCATCCTCCTGATGAG 

3.  TGTAAGGAGCTGCAGGAACTGGAGCCTGAGAATAAATG . . . GTGCCTGCTGACCATCATCCTCCTGATGAG 

4.  TGTAAGGAGCTGCAGGAACTGGAGCCTGAGAATAAATG . . . GTGCCTGCTGACCATCATCCTCCTGATGAG 

5.  TGTAAGGAGCTGCAGGAACTGGAGCCTGAGAATAAATG . . . GTGCCTGCTGACCATCATCCTCCTGATGAG 

1 • GGCACTGGACCCCCTCCTCTACGAGAAAGAAACACTGGAGTACTTCAGTACCCTCAAAGC. . . TGTGGACC 

2.  GGCACTGGACCCCCTCCTCTACGAGAAAGAAACACTGGAGTACTTCAGTACCCTCAAAGC... TGTGGACC 

3.  GGCACTGGACCCCCTCCTCTACGAGAAAGAAACACTGGAGTACTTCAGTACCCTCAAAGC... TGTGGACC 

4.  GGCACTGGACCCCCTCCTCTACGAGAAAGAAACACTGGAGTACTTCAGTACCCTCAAAGC . . . TGTGGACC 

5.  GGCACTGGACCCCCTCCTCTACGAGAAAGAAACACTGGAGTACTTCAGTACCCTCAAAGC . . . TGTGGACC 

1-  CGATGAGAGCAGCCTACTTGGACGACCTGCGCAGCAAGTTCTTGGTGGAAAACAGTGTTCTCAAGATGG 

2-  CGATGAGAGCAGCCTACTTGGACGACCTGCGCAGCAAGTTCTTGGTGGAAAACAGTGTTCTCAAGATGG 

3.  CGATGAGAGCAGCCTACTTGGACGACCTGCGCAGCAAGTTCTTGGTGGAAAACAGTGTTCTCAAGATGG 

4.  CGATGAGAGCAGCCTACTTGGACGACCTGCGCAGCAAGTTCTTGGTGGAAAACAGTGTTCTCAAGATGG 

5.  CGATGAGAGCAGCCTACTTGGACGACCTGCGCAGCAAGTTCTTGGTGGAAAACAGTGTTCTCAAGATGG 

1.  AGTATGCTGACGTGCGTGTGCTGCACCTGGCTCACAAGG . . . ATCTCACAGTGCTCTGCCATTTGGAACAAC 

2.  AGTATGCTGACGTGCGTGTGCTGCACCTGGCTCACAAGG . . . ATCTCACAGTGCTCTGCCATTTGGAACAAC 

3.  AGTATGCTGACGTGCGTGTGCTGCACCTGGCTCACAAGG... ATCTCACAGTGCTCTGCCATTTGGAACAAC 

4.  AGTATGCTGACGTGCGTGTGCTGCACCTGGCTCACAAGG . . . ATCTCACAGTGCTCTGCCATTTGGAACAAC 

5.  AGTATGCTGACGTGCGTGTGCTGCACCTGGCTCACAAGG . . . ATCTCACAGTGCTCTGCCATTTGGAACAAC 
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1.  TGCTCTTGGTCACTCATCTTGACCTGTCCCATAATCGTCTCCGAGCCTTGCCCCCAGCCCTGGCTGCTCTG 

2.  TGCTCTTGGTCACTCATCTTGACCTGTCCCATAATCGTCTCCGAGCCTTGCCCCCAGCCCTGGCTGCTCTG 

3.  TGCTCTTGGTCACTCATCTTGACCTGTCCCATAATCGTCTCCGAGCCTTGCCCCCAGCCCTGGCTGCTCTG 

4.  TGCTCTTGGTCACTCATCTTGACCTGTCCCATAATCGTCTCCGAGCCTTGCCCCCAGCCCTGGCTGCTCTG 

5.  TGCTCTTGGTCACTCATCTTGACCTGTCCCATAATCGTCTCCGAGCCTTGCCCCCAGCCCTGGCTGCTCTG 

1.  CGCTGTCTCGAGGT...GTTGCAGGCCAGTGATAATGTCCTGGAGAACTTGGACGGCGTGGCTAACCTTCCC 

2.  CGCTGTCTCGAGGT. . . GTTGCAGGCCAGTGATAATGTCCTGGAGAACTTGGACGGCGTGGCTAACCTTCCC 

3.  CGCTGTCTCGAGGT.. .GTTGCAGGCCAGTGATAATGTCCTGGAGAACTTGGACGGCGTGGCTAACCTTCCC 

4.  CGCTGTCTCGAGGT. . . GTTGCAGGCCAGTGATAATGTCCTGGAGAACTTGGACGGCGTGGCTAACCTTCCC 

5.  CGCTGTCTCGAGGT. . . GTTGCAGGCCAGTGATAATGTCCTGGAGAACTTGGACGGCGTGGCTAACCTTCCC 

1.  CGGCTGCGGGAGCTGTTACTATGTAATAACC. . . GCCTCCAGCAGTCTGCTGCACTTCAGACTCTTGCCTCC 

2.  CGGCTGCGGGAGCTGTTACTATGTAATAACC...GCCTCCAGCAGTCTGCTGCACTTCAGACTCTTGCCTCC 

3.  CGGCTGCGGGAGCTGTTACTATGTAATAACC...GCCTCCAGCAGTCTGCTGCACTTCAGACTCTTGCCTCC 

4.  CGGCTGCGGGAGCTGTTACTATGTAATAACC . . . GCCTCCAGCAGTCTGCTGCACTTCAGACTCTT GCCTCC 

5.  CGGCTGCGGGAGCTGTTACTATGTAATAACC...GCCTCCAGCAGTCTGCTGCACTTCAGACTCTTGCCTCC 

1 • TGCCCCAGGCTAGTCTTCCTCAACCTGCAGGGCAACTCGCTGTGCCAAGAAGAGGGCATCCGGGAGCGTT 

2. TGCCCCAGGCTAGTCTTCCTCAACCTGCAGGGCAACTCGCTGTGCCAAGAAGAGGGCATCCGGGAGCGTT 

3.  TGCCCCAGGCTAGTCTTCCTCAACCTGCAGGGCAACTCGCTGTGCCAAGAAGAGGGCATCCGGGAGCGTT 

4. TGCCCCAGGCTAGTCTTCCTCAACCTGCAGGGCAACTCGCTGTGCCAAGAAGAGGGCATCCGGGAGCGTT 

5. TGCCCCAGGCTAGTCTTCCTCAACCTGCAGGGCAACTCGCTGTGCCAAGAAGAGGGCATCCGGGAGCGTT 

1.  tggctgagatgctaccgtccgtcagcagcatcctcacctaggaggccccatctcaccctcaccctttaac 

2.  tggctgagatgctaccgtccgtcagcagcatcctcacctaggaggccccatctcaccctcaccctttaac 

3.  tggctgagatgctaccgtccgtcagcagcatcctcacctaggaggccccatctcaccctcaccctttaac 

4.  tggctgagatgctaccgtccgtcagcagcatcctcacctaggaggccccatctcaccctcaccctttaac 
s.  tggctgagatgctaccgtccgtcagcagcatcctcacctaggaggccccatctcaccctcaccctttaac 

1- TTATTGGGACTGAATAAAGACTGGAGAGGCCCCTCTAGGCTGCCAAGCTACCATTACTGCTGCCCTGACA 

2-  TTATTGGGACTGAATAAAGACTGGAGAGGCCCCTCTAGGCTGCCAAGCTACCATTACTGCTGCCCTGACA 

3.  TTATTGGGACTGAATAAAGACTGGAGAGGCCCCTCTAGGCTGCCAAGCTACCATTACTGCTGCCCTGACA 

4- TTATTGGGACTGAATAAAGACTGGAGAGGCCCCTCTAGGCTGCCAAGCTACCATTACTGCTGCCCTGACA 

5-  TTATTGGGACTGAATAAAGACTGGAGAGGCCCCTCTAGGCTGCCAAGCTACCATT ACT GCTGCCCT GACA 

1-  ACACTTCCTCCATTACCACCACTGTGCATTTTGGAAAAAGGAAGGGAGGGGCTCTCTGGGGTATGGGGCC 

2-  ACACTTCCTCCATTACCACCACTGTGCATTTTGGAAAAAGGAAGGGAGGGGCTCTCTGGGGTATGGGGCC 

3.  ACACTTCCTCCATTACCACCACTGTGCATTTTGGAAAAAGGAAGGGAGGGGCTCTCTGGGGTATGGGGCC 

4.  ACACTTCCTCCATTACCACCACTGTGCATTTTGGAAAAAGGAAGGGAGGGGCTCTCTGGGGTATGGGGCC 

5.  ACACTTCCTCCATTACCACCACTGTGCATTTTGGAAAAAGGAAGGGAGGGGCTCTCTGGGGTATGGGGCC 

1.  ACTGTCCTGTGATTCTGTCTACCAAGACCCTCAGTGTTTATGGCTGGTCAGGACCAGGGCTGGGTCTGAGT 

2.  ACTGTCCTGTGATTCTGTCTACCAAGACCCTCAGTGTTTATGGCTGGTCAGGACCAGGGCTGGGTCTGAGT 

3.  ACTGTCCTGTGATTCTGTCTACCAAGACCCTCAGTGTTTATGGCTGGTCAGGACCAGGGCTGGGTCTGAGT 

4.  ACTGTCCTGTGATTCTGTCTACCAAGACCCTCAGTGTTTATGGCTGGTCAGGACCAGGGCTGGGTCTGAGT 

5.  ACTGTCCTGTGATTCTGTCTACCAAGACCCTCAGTGTTTATGGCTGGTCAGGACCAGGGCTGGGTCTGAGT 

1.  TCACCATCAGTTTTGGGCTATCGTGGTGCCTGAGCTTGAAGCTGGGATAGGAATTAGGATCAGGTTGTGG 

2.  TCACCATCAGTTTTGGGCTATCGTGGTGCCTGAGCTTGAAGCTGGGATAGGAATTAGGATCAGGTTGTGG 

3.  TCACCATCAGTTTTGGGCTATCGTGGTGCCTGAGCTTGAAGCTGGGATAGGAATTAGGAT CAGGTT GTGG 

4.  TCACCATCAGTTTTGGGCTATCGTGGTGCCTGAGCTTGAAGCTGGGATAGGAATTAGGATCAGGTTGTGG 

5.  TCACCATCAGTTTTGGGCTATCGTGGTGCCTGAGCTTGAAGCTGGGATAGGAATTAGGATCAGGTTGTGG 

1 . TTTGGACTGCTCATCAGAAGAGCTGTCTCGGGGCTGGAGAGATGGCTCAGCGGTT AAGAGCGCT GACT GT 

2.  TTTGGACTGCTCATCAGAAGAGCTGTCTCGGGGCTGGAGAGATGGCTCAGCGGTTAAGAGCGCTGACTGT 

3.  TTTGGACTGCTCATCAGAAGAGCTGTCTCGGGGCTGGAGAGATGGCTCAGCGGTTAAGAGCGCTGACTGT 

4!tttggactgctcatcagaagagctgtctcggggctggagagatggctcagcggttaagagcgctgactgt 

5.TTTGGACTGCTCATCAGAAGAGCTGTCTCGGGGCTGGAGAGATGGCTCAGCGGTTAAGAGCGCTGACTGT 
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1. TCTTCTGAAAGTCCTGAGTTCAAATCCCAGCAACCACATGGTGGCTCATAACCATCTGTAATAAGATCTG 

2.  TCTTCTGAAAGTCCTGAGTTCAAATCCCAGCAACCACATGGTGGCTCATAACCATCTGTAATAAGATCTG 

3.  TCTTCTGAAAGTCCTGAGTTCAAATCCCAGCAACCACATGGTGGCTCATAACCATCTGTAATAAGATCTG 

4.  TCTTCTGAAAGTCCTGAGTTCAAATCCCAGCAACCACATGGTGGCTCATAACCATCTGTAATAAGATCTG 

5.  TCTTCTGAAAGTCCTGAGTTCAAATCCCAGCAACCACATGGTGGCTCATAACCATCTGTAATAAGATCTG 

1.  ACACCCTCTTCTGGTGTCTGAAGACAGCTACAGTGTACTTACATATAATAATAAATAAATCTT 

2.  ACACCCTCTTCTGGTGTCTGAAGACAGCTACAGTGTACTTACATATAATAATAAATAAATCTT 

3.  ACACCCTCTTCTGGTGTCTGAAGACAGCTACAGTGTACTTACATATAATAATAAATAAATCTT 

4.  ACACCCTCTTCTGGTGTCTGAAGACAGCTACAGTGTACTTACATATAATAATAAATAAATCTT 

5.  ACACCCTCTTCTGGTGTCTGAAGACAGCTACAGTGTACTTACATATAATAATAAATAAATCTT 


APPENDIX  B 

INTER-SPECIES  RABGGTA  SEQUENCE  AND  AMINO  ACID  COMPARISONS 


Inter-species  Rabggta  coding  sequence  comparisons,  a,  Mouse  (Query)  vs.  Rat 
(Subject)  sequence  alignment.  Over  the  1704  coding  nucleotides,  mouse  and 
rat  match  94%.  b,  Mouse  (Query)  vs.  Human  (Subject)  sequence  alignment. 
Over  the  1704  coding  nucleotides,  mouse  and  human  match  86%. 


(a)  Mouse  vs.  Rat  Sequence 


gb 1 L10415 I RATRABGERA  Rat  rab  geranylgeranyl  transferase  alpha-subunit  mRNA, 
complete  cds . 

Length  = 2660 

Score  = 2601  bits  (1312),  Expect  = 0.0 
Identities  = 1606/1704  (94%) 

Strand  = Plus  / Plus 


Query: 

1 

Sbj  ct : 

51 

Query : 

61 

Sbj  ct : 

111 

Query : 

121 

Sbj  ct : 

171 

Query: 

181 

Sbj  ct : 

231 

Query : 

241 

Sbj  ct : 

291 

Query: 

301 

Sbj  ct : 

351 

atgcatgggcgcctgaaggtgaagacgtccgaagagcaggcagaggccaaaaggctagag  60 

I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

atgcatgggcgcctgaaggtgaagacgtcggaagagcaggcagaggccaaaaggctagaa  110 

cgggaacagaagctaaagctctaccagtcagccactcaagctgtcttccagaagcgggag  120 

II  II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

cgagagcagaagctaaagctctaccagtcagccactcaagctgtcttccagaagcgtcag  170 

gcaggcgagctggatgaatccgtcctagaactgacaagccagattctgggagccaaccct  180 

II  II  I I I I I I I I I I I I I I II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

gctggtgagctggatgaatcagtgctagaactgacaagccagattctgggagccaaccct  230 

gattttgccaccctctggaactgtcgcagagaagtgctccagcagctagaaacccagaag  240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II  I I I I I I I I I I I 

gattttgccaccctctggaactgtcgaagagaagtactccagcacctggaaaccgagaag  290 

tcccctgaggagttggctgctcttgtgaaggcagaactaggcttccttgagagctgtctg  300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II  I I 

tcccctgaggagtcagctgctcttgtgaaggcagaactgggcttcctagagagctgtctt  350 

cgtgtgaaccctaagtcctatggcacttggcaccaccgctgctggctgctgagtcgcctg  360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

cgtgtgaaccctaagtcctatggcacttggcaccaccgctgctggctgctgagtcgcctg  410 
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Query : 
Sb j ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query : 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query : 
Sbj  ct : 

Query: 
Sbj  ct : 

Query : 
Sbj  ct : 

Query: 
Sbj  ct : 

Query : 
Sbj  ct : 


361  cctgagcccaactgggcccgggagctggagctgtgcgctcgcttcctcgaggccgatgag  420 

I I I I I I I I I II  II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

411  cctgagcccaactgggcccgggagctggagctatgcgctcgcttcctcgaggccgatgag  470 

421  cggaactttcattgctgggactatcggcgatttgtagctgcacaggctgctgtcgcgcct  480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II  III 

471  cggaactttcattgctgggactatcggcgatttgtagccgcacaggctgctgttgcccct  530 

481  gcagaggaactagccttcactgacagcctcatcacccggaacttctccaactattcttcc  540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

531  gcagaggaactagccttcactgacagcctcatcacccgtaacttctccaactattcctcc  590 

541  tggcattatcgctcctgcctcttgccccagctgcacccccagccagactctggcccccag  600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

591  tggcattatcgctcctgcctcttgccacagctgcacccccagccagactctggcccacag  650 

601  ggacggctccctgaaaatgtactgctgagagagctggagttggtgcagaatgccttcttt  660 

II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

651  ggtcggctccctgaaaatgtactgctgaaagagctggagttggtgcagaatgccttcttt  710 

661  actgaccccaatgatcagagtgcctggttctatcaccgctggctcctggggcgggcggag  720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

711  actgaccccaacgatcagagtgcctggttctatcaccgctggctcctgggtcgggcggag  770 

721  ccccacgatgttctgtgctgcctgcatgtgagccgggaagaggcctgtctgtcagtctgc  780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

771  ccccacgatgttctatgctgtgtgcatgtgagccgggaggaggcctgtctgtcggtctgc  830 

781  ttttctcgtcccctgatagtgggctccaagatggggaccttgctactcacggttgacgag  840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

831  ttttctcgtcccctgacagtgggctccagaatggggaccttgctactcatggttgatgag  890 

841  gcacctttgagtgtggaatggaggactccagatggcaggaaccggcccagccatgtctgg  900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

891  gcacctttgagtgtggagtggaggactccagatggcaggaaccggcccagccatgtctgg  950 

901  ctctgtgacctgcctgctgcctctctcaatgaccacttgccccagcacacgtttcgtgtc  960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

951  ctctgtgacctgcctgctgcctctctcaatgaccagttgccccagcacacgtttcgtgtc  1010 

961  atctggacaggaagcgatacccagaaggagtgtgtgcttttaaaaggtcaccaggagtgc  1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II  II  III 

1011  atctggacaggaagtgatagccagaaggagtgtgtgcttttaaaagaccgcccagaatgc  1070 
1021  tggtgccgagactcagccactgatgaacagttgttcaggtgtgagctgtccgtggagaag  1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

1071  tggtgccgagactcagccactgatgaacagctattcaggtgtgagctgtcagtggaaaag  1130 
1081  tccacagtgctacagtccgagcttgagtcctgtaaggagctgcaggaactggagcctgag  1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1131  tccacagtgctgcagtctgagcttgaatcctgtaaggagctgcaggaactagagcctgag  1190 
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Query: 
Sbj  ct : 

Query : 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 


1141  aataaatggtgcctgctgaccatcatcctcctgatgagggcactggaccccctcctctac  1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1191  aataaatggtgcctgctgaccatcatcctgctgatgcgagcactggaccccctcctctat  1250 


1201  gagaaagaaacactggagtacttcagtaccctcaaagctgtggacccgatgagagcagcc  1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II  I I II  I I I I I 

1251  gagaaggaaacactgcagtacttcagtaccctcaaagctgtggacccaatgagagcagct  1310 

1261  tacttggacgacctgcgcagcaagttcttggtggaaaacagtgttctcaagatggagtat  1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1311  tacttggatgacctgcgcagcaagttcttgctggagaacagtgttctcaagatggagtac  1370 

1321  gctgacgtgcgtgtgctgcacctggctcacaaggatctcacagtgctctgccatttggaa  1380 

I I I I I II  II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1371  gctgatgtacgcgtgctgcacctggctcacaaggatctcacagtgctctgccacctggaa  1430 

1381  caactgctcttggtcactcatcttgacctgtcccataatcgtctccgagccttgccccca  1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1431  caactgctcttggtcactcatcttgacctgtcccataatcgactccgagccttaccccca  1490 

1441  gccctggctgctctgcgctgtctcgaggtgttgcaggccagtgataatgtcctggagaac  1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1491  gccctggctgctctgcgctgtcttgaggtgctgcaggccagtgacaatgccctggagaac  1550 

1501  ttggacggcgtggctaaccttccccggctgcgggagctgttactatgtaataaccgcctc  1560 

I I I I II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1551  gtggatggtgtggctaaccttccccggctacaagagctgttactatgtaataaccgcctc  1610 

1561  cagcagtctgctgcacttcagactcttgcctcctgccccaggctagtcttcctcaacctg  1620 

II  I I I I I I I I III  I III  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1611  caacagtctgcagcaatccagcctcttgtctcctgccccaggctagtcctcctcaacctg  1670 

1621  cagggcaactcgctgtgccaagaagagggcatccgggagcgtttggctgagatgctaccg  1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1671  cagggcaactcgctgtgccaagaagagggcatccaggagcgtttggctgagatgctaccg  1730 

1681  tccgtcagcagcatcctcacctag  1704 

II  I I I I I I I I I I I I I I I I I I I I I 

1731  tctgtcagcagcatcctcacctag  1754 
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(b)  Mouse  vs.  Human  Sequence 


ref  I NM_004581 . 1 1 RABGGTA I Homo  sapiens  Rab  geranylgeranyltransf erase,  alpha 
subunit  (RABGGTA) 

itiRNA  >gi  I 2950169  I emb  I Y08200  I HSRABGTRA  Homo  sapiens  rtiRNA 
for  rab  geranylgeranyl  transferase,  alpha-subunit 
Length  = 2067 

Score  = 1505  bits  (759),  Expect  = 0.0 
Identities  = 1467/1703  (86%) 

Strand  = Plus  / Plus 


Query: 

1 

Sb j ct : 

275 

Query: 

61 

Sbj  ct : 

335 

Query: 

121 

Sbj  ct : 

395 

Query: 

181 

Sbj  ct : 

455 

Query: 

241 

Sbj  ct : 

515 

Query: 

301 

Sbj  ct : 

575 

Query: 

361 

Sbj  ct : 

635 

Query: 

421 

Sbj  ct : 

695 

Query : 

481 

Sbj  ct : 

755 

Query: 

541 

Sbj  ct : 

815 

atgcatgggcgcctgaaggtgaagacgtccgaagagcaggcagaggccaaaaggctagag  60 

I I I I I II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

atgcacggacgcctgaaggtgaagacgtcagaagagcaggcggaggccaaaaggctagag  334 

cgggaacagaagctaaagctctaccagtcagccactcaagctgtcttccagaagcgggag  120 

II  II  I I I I I I I I I I I I I I I I I I I I I I I I I I I II  II  II  I I I I I I I I I I I II 

cgagagcagaagctgaagctataccagtcagccacccaggccgtattccagaagcgccag  394 

gcaggcgagctggatgaatccgtcctagaactgacaagccagattctgggagccaaccct  180 

II  II  I I I I I I I I I I I I I I I I II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

gctggtgagctggatgagtccgtgctggaactgacaagccagattctgggagccaaccct  454 

gattttgccaccctctggaactgtcgcagagaagtgctccagcagctagaaacccagaag  240 

I I I I I I I I I I I I I I I I I I I I I I I II  I I I I I I I I I I I I I I I I I I II  II  I I I I I I 

gattttgccaccctctggaactgccgacgagaggtgctccagcagctggagactcagaag  514 

tcccctgaggagttggctgctcttgtgaaggcagaactaggcttccttgagagctgtctg  300 

II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

tctcctgaagagttggctgctctggtgaaggcagaactgggcttcctggagagctgcctg  574 

cgtgtgaaccctaagtcctatggcacttggcaccaccgctgctggctgctgagtcgcctg  360 

II  I I I I I I I I I I I I I I I I I I II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

cgggtgaaccccaagtcttatggtacctggcaccaccgatgctggctgctaggccgcctg  634 

cctgagcccaactgggcccgggagctggagctgtgcgctcgcttcctcgaggccgatgag  420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II  II  II  I I I I I I I I I I I I I I I 

cctgagcccaactggacccgagagctggagctctgtgcccgtttcctggaggtggatgag  694 

cggaactttcattgctgggactatcggcgatttgtagctgcacaggctgctgtcgcgcct  480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II  I I I I I I I II  II  I III 

cggaactttcactgctgggactatcggcggtttgtggccacacaggcagccgtgccccct  754 

gcagaggaactagccttcactgacagcctcatcacccggaacttctccaactattcttcc  540 

I I I I I II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

gcagaagagctagccttcactgacagcctcatcacccgaaacttctccaactactcttcc  814 

tggcattatcgctcctgcctcttgccccagctgcacccccagccagactctggcccccag  600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II  I I I I I II  III 

tggcattaccgctcctgtctcttgccccagttgcacccccagccggattctggaccacag  874 


94 


Query: 
Sb j ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query : 
Sbj  ct : 

Query : 
Sbj  ct : 

Query : 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query : 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 

Query: 
Sbj  ct : 


601  ggacggctccctgaaaatgtactgctgagagagctggagttggtgcagaatgccttcttt  660 

II  II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

875  gggcgcctccctgaggatgtgctgctcaaagagctggagctggtgcagaatgccttcttc  934 

661  actgaccccaatgatcagagtgcctggttctatcaccgctggctcctggggcgggcggag  720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II  II  II  II 

935  actgaccccaatgaccagagtgcctggttttatcaccggtggctcctaggtcgagctgac  994 

721  ccccacgatgttctgtgctgcctgcatgtgagccgggaagaggcctgtctgtcagtctgc  780 

I I I I I I I I I III  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

995  ccccaggatgcactgcgctgtctgcatgtgagccgggacgaggcctgtctgactgtctcc  1054 

781  ttttctcgtcccctgatagtgggctccaagatggggaccttgctactcacggttgacgag  840 

II  I I I I I I I I I I I I I I I I I I I I I I I I I I I II  I I I I I I I I I I I I I I I I II 

1055  ttctctcggcccctcttagtgggctccaggatggagatcttgctgctcatggttgatgat  1114 

841  gcacctttgagtgtggaatggaggactccagatggcaggaaccggcccagccatgtctgg  900 

I II  III  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1115  tctcccctgattgtggagtggaggaccccagatggcaggaaccggcccagccatgtctgg  1174 

901  ctctgtgacctgcctgctgcctctctcaatgaccacttgccccagcacacgtttcgtgtc  960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II  II  I I I I I III 

1175  ctctgtgacctgcctgctgcctccctcaacgaccagttgccccaacatacatttcgcgtc  1234 

961  atctggacaggaagcgatacccagaaggagtgtgtgcttttaaaaggtcaccaggagtgc  1020 

II  I I I I I I I I I I I I I I I I I I I II  II  I I I I I I I I I I I I I I I I I I I I I I II 

1235  atttggacagcaggcgatgtccagaaagaatgcgtgcttttaaaaggccgccaggagggc  1294 

1021  tggtgccgagactcagccactgatgaacagttgttcaggtgtgagctgtccgtggagaag  1080 

I I I I I I I I I I I I I I II  II  II  III  I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1295  tggtgccgggactccacgacagacgagcagctattcaggtgtgagctgtcagtggagaag  1354 

1081  tccacagtgctacagtccgagcttgagtcctgtaaggagctgcaggaactggagcctgag  1140 

I I I I I I I I I I I I I I I I I I I I I II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1355  tccacagtgctgcagtctgagctggaatcctgtaaggagctgcaggagctggagcctgag  1414 

1141  aataaatggtgcctgctgaccatcatcctcctgatgagggcactggaccccctcctctac  1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II  II 

1415  aataaatggtgcctgcttaccatcatcctgctgatgcgggcactggaccccctgctgtat  1474 

1201  gagaaagaaacactggagtacttcagtaccctcaaagctgtggacccgatgagagcagcc  1260 

I I I I I II  II  III  I I I I I I I I I I I I I I I I II  I I I I I I I I III  I III  I 

1475  gagaaggagaccctgcagtacttccagaccctcaaggccgtggaccccatgcgggcaacg  1534 

1261  tacttggacgacctgcgcagcaagttcttggtggaaaacagtgttctcaagatggagtat  1320 

II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II  II  II  I I I I I I I I I I I I I I I 

1535  tatctggatgacctgcgcagcaagttcttgctggagaatagcgtgctcaagatggagtat  1594 

1321  gctgacgtgcgtgtgctgcacctggctcacaaggatctcacagtgctctgccatttggaa  1380 

II  II  I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1595  gccgaggtgcgtgtgctgcacctggctcacaaggatctgacagtgctctgccatctggaa  1654 
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Query : 
Sbj  ct : 

Query: 
Sbj  ct : 

Query : 
Sbj  ct : 

Query : 
Sbj  ct : 

Query: 
Sbj  ct : 

Query : 
Sbj  ct : 


1381  caactgctcttggtcactcatcttgacctgtcccataatcgtctccgagccttgccccca  1440 

II  I I I I I I I I I I I I I I I I I I I I I I I I I I I II  I I I I I I I I I I I II  I I I I II 

1655  cagctgctcttggtcacccatcttgacttgtcacacaatcgcctccgaaccctgccacct  1714 

1441  gccctggctgctctgcgctgtctcgaggtgttgcaggccagtgataatgtcctggagaac  1500 
II  I I I I I I I I I I I I I I I I II  I I I I I I I I I I I I I I I I I I I I I I I I I III  I 
1715  gcactggctgccctgcgctgccttgaggtgctgcaggccagtgataatgcaatagagtcc  1774 

1501  ttggacggcgtggctaaccttccccggctgcgggagctgttactatgtaataaccgcctc  1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II  II  I I I I I I I I I 

1775  ctggacggcgtcaccaacctaccccggctgcaggagctgctactgtgcaacaaccgcctc  1834 

1561  cagcagtctgctgcacttcagactcttgcctcctgccccaggctagtcttcctcaacctg  1620 

I I I I I I I I I I I II  III  I I I I I I I I I I I I I I I I I I I I I I III  I I I I I I I I I I I 

1835  cagcagcctgcagtgctccagcctcttgcctcctgccccaggctggtcctcctcaacctg  1894 

1621  cagggcaactcgctgtgccaagaagagggcatccgggagcgtttggctgagatgctaccg  1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1895  cagggtaacccgctgtgccaagcggtgggcatcttggagcaactggctgaactgctgcct  1954 

1681  tccgtcagcagcatcctcaccta  1703 

II  II  I I I I I I I I I I I I I I I I 

1955  tcagttagcagcgtcctcaccta  1977 
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Inter-species  Rabggta  protein  comparisons,  a,  Mouse  (top  line)  vs.  Rat  (bottom 
line)  protein  alignment.  Over  the  567  amino  acids,  mouse  and  rat  match  95.8%. 
No  large  regional  differences  are  observed,  b,  Mouse  (top  line)  vs.  Human 
(bottom  line)  protein  alignment.  Over  the  567  amino  acids,  mouse  and  human 
match  90.8%.  No  large  regional  differences  are  observed,  c,  Mouse  (top  line) 
vs.  Yeast  BET4  (bottom  line)  protein  alignment.  Over  290  amino  acids,  mouse 
and  yeast  match  33%. 


(a)  Mouse  vs.  Rat  Amino  Acid 


sp|Q08602 I PGTA_RAT  RAB  GERANYLGERANYLTRANSFERASE  ALPHA  SUBUNIT  (RAB 

GERANYL-GERANYLTRANSFERASE  ALPHA  SUBUNIT)  (RAB  GG 
TRANSFERASE)  (RAB  GGTASE) 

Length  = 567 

Score  = 1033  bits  (2643),  Expect  = 0.0 

Identities  = 543/567  (96%),  Positives  = 554/567  (98%) 


Query : 

1 

Sbj  ct : 

1 

Query: 

61 

Sbj  ct : 

61 

Query : 

121 

Sbj  ct : 

121 

Query : 

181 

Sbj  ct : 

181 

Query : 

241 

Sbj  ct : 

241 

Query : 

301 

Sbj  ct : 

301 

Query : 

361 

Sbj  ct : 

361 

Query: 

421 

Sbj  ct : 

421 

Query : 

481 

Sbj  ct : 

481 

Query : 

541 

Sbj  ct : 

541 

MHGRLKVKTSEEQAEAKRLEREQKLKLYQSATQAVFQKREAGELDESVLELTSQILGANP  60 
MHGRLKVKTSEEQAEAKRLEREQKLKLYQSATQAVFQKR+AGELDESVLELTSQILGANP 
MHGRLKVKTSEEQAEAKRLEREQKLKLYQSATQAVFQKRQAGELDESVLELTSQILGANP  60 

DFATLWNCRREVLQQLETQKSPEELAALVKAELGFLESCLRVNPKSYGTWHHRCWLLSRL  120 
DFATLWNCRREVLQ  LET+KSPEE  AALVKAELGFLESCLRVNPKSYGTWHHRCWLLSRL 
DFATLWNCRREVLQHLETEKSPEESAALVKAELGFLESCLRVNPKSYGTWHHRCWLLSRL  120 

PEPNWARELELCARFLEADERNFHCWDYRRFVAAQAAVAPAEELAFTDSLITRNFSNYSS  180 
PEPNWARELELCARFLEADERNFHCWDYRRFVAAQAAVAPAEELAFTDSLITRNFSNYSS 
PEPNWARELELCARFLEADERNFHCWDYRRFVAAQAAVAPAEELAFTDSLITRNFSNYSS  180 

WHYRSCLLPQLHPQPDSGPQGRLPENVLLRELELVQNAFFTDPNDQSAWFYHRWLLGRAE  240 
WHYRSCLLPQLHPQPDSGPQGRLPENVLL+ELELVQNAFFTDPNDQSAWFYHRWLLGRAE 
WHYRSCLLPQLHPQPDSGPQGRLPENVLLKELELVQNAFFTDPNDQSAWFYHRWLLGRAE  240 

PHDVLCCLHVSREEACLSVCFSRPLIVGSKMGTLLLTVDEAPLSVEWRTPDGRNRPSHVW  300 
PHDVLCC+HVSREEACLSVCFSRPL  VGS+MGTLLL  VDEAPLSVEWRTPDGRNRPSHVW 
PHDVLCCVHVSREEACLSVCFSRPLTVGSRMGTLLLMVDEAPLSVEWRTPDGRNRPSHVW  300 

LCDLPAASLNDHLPQHTFRVIWTGSDTQKECVLLKGHQECWCRDSATDEQLFRCELSVEK  360 
LCDLPAASLND  LPQHTFRVIWTGSD+QKECVLLK  ECWCRDSATDEQLFRCELSVEK 
LCDLPAASLNDQLPQHTFRVIWTGSDSQKECVLLKDRPECWCRDSATDEQLFRCELSVEK  360 

STVLQSELESCKELQELEPENKWCLLTIILLMRALDPLLYEKETLEYFSTLKAVDPMRAA  420 
STVLQSELESCKELQELEPENKWCLLTIILLMRALDPLLYEKETL+YFSTLKAVDPMRAA 
STVLQSELESCKELQELEPENKWCLLTIILLMRALDPLLYEKETLQYFSTLKAVDPMRAA  420 

YLDDLRSKFLVENSVLKMEYADVRVLHLAHKDLTVLCHLEQLLLVTHLDLSHNRLRALPP  480 
YLDDLRSKFL+ENSVLKMEYADVRVLHLAHKDLTVLCHLEQLLLVTHLDLSHNRLRALPP 
YLDDLRSKFLLENSVLKMEYADVRVLHLAHKDLTVLCHLEQLLLVTHLDLSHNRLRALPP  480 

ALAALRCLEVLQASDNVLENLDGVANLPRLRELLLCNNRLQQSAALQTLASCPRLVFLNL  540 
ALAALRCLEVLQASDN  LEN+DGVANLPRL+ELLLCNNRLQQSAA+Q  L SCPRLV  LNL 
ALAALRCLEVLQASDNALENVDGVANLPRLQELLLCNNRLQQSAAIQPLVSCPRLVLLNL  540 

QGNSLCQEEGIRERLAEMLPSVSSILT  567 
QGNSLCQEEGI+ERLAEMLPSVSSILT 
QGNSLCQEEGIQERLAEMLPSVSSILT  567 
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(b)  Mouse  vs.  Human  Amino  Acid 

sp|Q92696|PGTA_HUMAN  RAB  GERANYLGERANYLTRANSFERASE  ALPHA  SUBUNIT  (RAB 

GERANYL-GERANYLTRANSFERASE  ALPHA  SUBUNIT)  (RAB  GG 
TRANSFERASE)  (RAB  GGTASE) 

Length  = 567 

Score  = 977  bits  (2499),  Expect  = 0.0 

Identities  = 514/567  (91%),  Positives  = 535/567  (94%) 


Query : 

1 MHGRLKVKTSEEQAEAKRLEREQKLKLYQSATQAVFQKREAGELDESVLELTSQILGANP  60 

MHGRLKVKTSEEQAEAKRLEREQKLKLYQSATQAVFQKR+AGELDESVLELTSQILGANP 

Sbj  ct : 

1 MHGRLKVKTSEEQAEAKRLEREQKLKLYQSATQAVFQKRQAGELDESVLELTSQILGANP  60 

Query : 

61  DFATLWNCRREVLQQLETQKSPEELAALVKAELGFLESCLRVNPKSYGTWHHRCWLLSRL  120 
DFATLWNCRREVLQQLETQKSPEELAALVKAELGFLESCLRVNPKSYGTWHHRCWLL  RL 

Sbj  ct : 

61  DFATLWNCRREVLQQLETQKSPEELAALVKAELGFLESCLRVNPKSYGTWHHRCWLLGRL  120 

Query : 

121  PEPNWARELELCARFLEADERNFHCWDYRRFVAAQAAVAPAEELAFTDSLITRNFSNYSS  180 
PEPNW  RELELCARFLE  DERNFHCWDYRRFVA+QAAV  PAEELAFTDSLITRNFSNYSS 

Sbj  ct : 

121  PEPNWTRELELCARFLEVDERNFHCWDYRRFVATQAAVPPAEELAFTDSLITRNFSNYSS  180 

Query : 

181  WHYRSCLLPQLHPQPDSGPQGRLPENVLLRELELVQNAFFTDPNDQSAWFYHRWLLGRAE  240 
WHYRSCLLPQLHPQPDSGPQGRLPE+VLL+ELELVQNAFFTDPNDQSAWFYHRWLLGRA+ 

Sbj  ct : 

181  WHYRSCLLPQLHPQPDSGPQGRLPEDVLLKELELVQNAFFTDPNDQSAWFYHRWLLGRAD  240 

Query : 

241  PHDVLCCLHVSREEACLSVCFSRPLIVGSKMGTLLLTVDEAPLSVEWRTPDGRNRPSHVW  300 
P D L CLHVSR+EACL+V  FSRPL+VGS+M  LLL  VD++PL  VEWRTPDGRNRPSHVW 

Sbj  ct : 

241  PQDALRCLHVSRDEACLTVSFSRPLLVGSRMEILLLMVDDSPLIVEWRTPDGRNRPSHVW  300 

Query : 

301  LCDLPAASLNDHLPQHTFRVIWTGSDTQKECVLLKGHQECWCRDSATDEQLFRCELSVEK  360 
LCDLPAASLND  LPQHTFRVIWT  D QKECVLLKG  QE  WORDS  TDEQLFRCELSVEK 

Sbj  ct : 

301  LCDLPAASLNDQLPQHTFRVIWTAGDVQKECVLLKGRQEGWCRDSTTDEQLFRCELSVEK  360 

Query: 

361  STVLQSELESCKELQELEPENKWCLLTIILLMRALDPLLYEKETLEYFSTLKAVDPMRAA  420 
STVLQSELESCKELQELEPENKWCLLTIILLMRALDPLLYEKE  L+YF  TLKAVDPMRA 

Sbj  ct : 

361  STVLQSELESCKELQELEPENKWCLLTIILLMRALDPLLYEKERLQYFQTLKAVDPMRAT  420 

Query: 

421  YLDDLRSKFLVENSVLKMEYADVRVLHLAHKDLTVLCHLEQLLLVTHLDLSHNRLRALPP  480 
YLDDLRSKFL+ENSVLKMEYA+VRVLHLAHKDLTVLCHLEQLLLVTHLDLSHNRLR+LPP 

Sbj  ct : 

421  YLDDLRSKFLLENSVLKMEYAEVRVLHLAHKDLTVLCHLEQLLLVTHLDLSHNRLRTLPP  480 

Query : 

481  ALAALRCLEVLQASDNVLENLDGVANLPRLRELLLCNNRLQQSAALQTLASCPRLVFLNL  540 
ALAALRCLEVLQASDN  +E+L+GV+NLPRL+ELLLCNNRLQQ  A LQ  LASCPRLV  LNL 

Sbj  ct : 

481  ALAALRCLEVLQASDNAIESLEGVTNLPRLQELLLCNNRLQQPAVLQPLASCPRLVLLNL  540 

Query : 

541  QGNSLCQEEGIRERLAEMLPSVSSILT  567 
QGN  LCQ  GI  E+LAE+LPSVSS+LT 

Sbj  ct : 

541  QGNPLCQAVGILEQLAELLPSVSSVLT  567 
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(c)  Mouse  vs.  Yeast  Amino  Acid 


sp|Q00618 |BET4_YEAST  TYPE  II  PROTEINS  GERANYLGERANYLTRANSFERASE  ALPHA  SUBUNIT 
(TYPE  II 

PROTEIN  GERANYL-GERANYLTRANSFERASE  ALPHA  SUBUNIT) 

(GGTASE-II-ALPHA)  (PGGT)  (YPT1/SEC4  PROTEINS 
GERANYLGERANYLTRANSFERASE  ALPHA  SUBUNIT) 

Length  = 290 


Score  = 117  bits  (291) , Expect  = 4e-26 

Identities  = 68/206  (33%),  Positives  = 103/206  (50%),  Gaps  = 20/206  (9%) 

Query:  49  LELTSQILGANPDFATLWNCRREVLQQLETQKSPEELAALVKAELGFLESCLRVNPKSYG  108 

L+  TS++L  NP+F  +WN  RR+++  L ++  EL  F+  L+  PK  Y 

Sbjct : 12  LKKTSELLEKNPEFNAIWNYRRDIIASLASELE IPFWDKELVFVMMLLKDYPKVYW  67 

Query:  109  TWHHRCWLLSRLPEPN WARELELCARFLEADERNFHCWDYRRFVAAQAAVAPA 165 

W+HR  W+L  P + W EL  + + LE  D RN+H  W YRR  V + 

Sb j Ct : 68  IWNHRLWVLKHYPTSSPKVWQTELAVVNKLLEQDARNYHGWHYRRIVVGNIESITNKSLD  127 

Query:  166  -EELAFTDSLITRNFSNYSSWHYRSCLLPQLHPQPDSGPQGRLPENVLLRELELVQNAFF  220 

EE+  +T  IN  SNYS+WH  R ++  -I-+  + + G Q + + E+  + NA  F 

Sbjct:  128  KEEFEYTTIKINNNISNYSAWHQRVQIISRMFQKGEVGNQ KEYIRTEISYIINAMF  183 


Query : 
Sbj  ct : 


221  TDPNDQSAWFYHRWLLGRAEPHDVLC  246 
TD  DQS  WFY  +W  + +D++C 

184  TDAEDQSVWFYIKWFI KNDIVC  205 
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